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The test for Abenomics 


Japanese Prime Minister Shinzo Abe has been buoyed by election success, but he must show that 
his science policies take the opinions of researchers into account. 


Sunday's upper-house election was an impressive show of sup- 

port for a sitting prime minister in a country that tends to chew 
its political leaders up and spit them out in short order. Especially 
popular with voters are Prime Minister Shinzo Abe’s programmes for 
financial stimulus, which have started to breathe life into a country in 
a state of economic paralysis. 

So what does the re-election mean for research and science? Abe, 
who took office last September, has been relatively generous with the 
scientific purse strings so far, anda prime minister who stays in power 
for more than a year might bring some stability to vacillating science 
policies. For example, the country’s Council for Science and Technol- 
ogy Policy, once a strong body that set and streamlined science priori- 
ties, has in the past three years been reduced to rubber-stamping the 
decisions of others. Now it is set to resurge. 

Abe has political capital to spend. But he may yet choose to use it to 
push positions that are unpopular with both scientists and the public. 

First among them is his stand in favour of nuclear power. All polls 
show that most people in Japan do not want the country’s reactors — 
shut down after the 2011 Fukushima accident — to be restarted. Yet 
Abe is pushing for that. The prime minister has tried to press the idea 
that the dysfunctional nuclear regulatory system that exacerbated the 
Fukushima accident has been fixed, but events in the past few weeks 
have shown that neither Fukushima’s ominously steaming reactors nor 
the country’s regulatory system are yet working correctly. The latest 
event to draw outrage was the discovery that Fukushima’s operator, the 
Tokyo Electric Power Company, this year waited for more than a month 
before admitting that radioactive water from the plant was leaking into 
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the sea. The Japanese Nuclear Regulation Authority actually did its job, 
drawing public attention to the contamination weeks ago. But what 
power does the regulatory system have if the nuclear operator can be 
so slow to respond? Suspicious observers are already suggesting that it 
waited until after the election to confirm the bad news. 

Abe's support for science is also getting mixed reviews. His empha- 
sis on technology transfer has hit a popular chord with industrial- 
ists, as have his plans to merge biomedical research funding under a 

body inspired by the US National Institutes 


“For now, most of Health (NIH). But the planned Japanese 


scientists seethe NIHis quite different from its namesake — 
proposals as a it is intended to focus on applied science, 
threat.” and will be supervised mostly by Abe and 


bureaucrats, rather than by scientists (see 
Nature 499, 136-137; 2013). 

This could send science off course. Already, Japan has focused too 
optimistically on induced pluripotent stem cells, and funding is set 
to increase further. The country should give more thought to what 
science and clinical applications are likely to emerge. Scientists in other 
fields feel they are being shunned. Is the Japanese NIH going to pro- 
vide balance? It should, but Abe’s proposal does not promise to do so. 

A genuine copy of the US NIH — one that seeks and takes into 
account the opinions of scientists — could bea real boon in Japan. Yet 
for now, most scientists see the proposals as a threat, especially given 
that they are being implemented from the top down. 

As the election results show, Abe is doing something right. Intro- 
ducing more science and less politics to the operation of this NIH-like 
agency would be an early way to show that he can listen as well as lead. m 


Forensics fiasco 


Inconsistent standards and a lack of research 
investment have left UK legal science in chaos. 


for research, but even we might not say that reductions in cash 
for science could be allowing murderers and rapists to roam our 
streets. We don't have to: Andrew Miller has said it for us. 

Miller leads the combative but respected (and cross-party) science 
select committee in the UK House of Commons. The committee last 
week produced a damning report — its second in just over three years 
— on the state of forensic science in the country. In his alarming sound 
bite, Miller neatly summarized the need for urgent government action, 
including dedicated funds for research into better sleuthing methods. 


Te publication is steadfast in promoting the benefits of funding 


The British system is a perfect case study of a wider forensics malaise. 
The Forensic Science Service, which provided services to police forces 
across the nation, was subjected to a disastrous attempt at privatization 
before being closed in March 2012. Police laboratories have inconsistent 
standards, and private companies have been asked to fill the gap. 

The problems that Miller's committee identifies are long-standing. 
But this time the politicians have upped the rhetorical ante, expressing 
concern that the minister responsible for forensic science “appeared 
to have so little understanding of the subject”. 

The shortcomings in this field are not restricted to the United King- 
dom. In February, the US Department of Justice announced a new 
National Commission on Forensic Science that will develop guidance 
across the spectrum of forensics, from courtroom to laboratory, on 
matters such as professional codes. It is sorely needed: just last month, 
the Department of Justice announced that more than 2,000 crimi- 
nal cases were being reviewed because of problems with hair-sample 
analysis. Forensic science holds great power over the lives and liberty 
of individuals. Now it must reclaim its great responsibility. m 
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promises in science and technology, so the campaign documents 

from the centre-right Pakistan Muslim League stood out. In its 
bid for power earlier this year, this party of landowners and industrial- 
ists did something unexpected: it pledged to put science and technology 
at the heart of its governing strategy. 

It promised one million new high-tech jobs in the next five years. It 
talked of new funding agencies for biotechnology and nanotechnology, 
of new programmes in fuel cells and small satellites, and of revamping 
agricultural research. And it included a promise that the state would not 
interfere in the appointment of university vice-chancellors. 

If kept, the promises would mark a step in the right direction for 
Pakistan, helping it to shed its reliance on international aid and edge 
towards economic independence and stability. 
But they seemed a little too good to be true. 

The Muslim League is now in power after 
beating the ruling Pakistan People’s Party in 
May’s elections. And already, the glittering future 
it promised for research is in doubt. It is not too 
late for the pledges to be realized, and science can 
still play a pivotal part in the development of the 
country, but it will require strong and committed 
leadership to see the changes through. 

The poor performance of science in Pakistan 
is a reflection of the discipline’s traditionally 
low political standing. So often in the country’s 
66-year history, the science ministry has been 
accorded the status of backwater, or has been 
used as a political football at the whim of party 
leaders for reasons that have little to do with science or technology 
policy. In Pakistan, ‘science minister’ is not seen as a proper job. 

Sometimes, the post has even been left vacant. In the 1990s, the 
centre-left government of Prime Minister Benazir Bhutto, for exam- 
ple, chose not to appoint a dedicated science minister during her 
second term. At other times, appointments have been questionable. 
One former science minister, the landowner Syeda Abida Hussain, 
was censured last year by the Press Council of Pakistan for promoting 
smoking in an article for Newsweek Pakistan. 

The latest incumbent, Zahid Hamid, is an unpromising choice for 
science minster. The position was his reward from the Muslim League 
for switching political sides — and it is questionable whether he has the 
vision or the experience to effect the changes promised by the league. 

Hamid also faces a political challenge that could see him exit the 
ministry as quickly as he arrived. He once was loyal to the former mili- 
tary dictator General Pervez Musharraf, who is 
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Pakistan must seize the 
chance to revive its science 


The government of Nawaz Sharif needs to appoint a tough leader to deliver on 
its election promise of a research renaissance, says Ehsan Masood. 


place — people who have a record of building successful institutions 
at the interface of knowledge and commerce, despite the poor track 
record of Pakistan's science ministry. Any one of these would be well 
placed to turn the promises of the manifesto into reality. 

Science does have the ear of the new prime minister, Nawaz Sharif. 
The man who was instrumental in placing science at the heart of the 
league’s manifesto, Ahsan Iqbal, is minister for planning and devel- 
opment and a member of Sharif’s inner circle. Should Hamid depart, 
Iqbal would have a golden opportunity to push for a widely respected 
and heavy-hitting science minister. 

The country has had such a science minister before. Although most 
Pakistanis have nothing good to say about Musharraf, young and old are 
in awe of the science minister he appointed, the University of Karachi 
chemistry professor Atta-ur-Rahman. Rahman 
secured a large budget increase and spent much 
of the funds hiring senior faculty members and 
boosting early-career programmes. As Rahman's 
association with the general effectively rules out 
a return to a political role, who else is suitable? 

One candidate is plant geneticist Kauser Malik, 
founding director of the National Institute for Bio- 
technology and Genetic Engineering in Faisalabad 
and one-time chairman of the Pakistan Agricul- 
tural Research Council. Malik has experience as 
both bench scientist and high-level administrator. 
He also has plenty of international connections 
and a record of delivering results. Alternatively, 
the government could look to the next genera- 
tion and appoint Muhammad Iqbal Choudhary, 
director of the HE) Research Institute of Chemistry at the University 
of Karachi, which is among the largest such facilities in Asia. 

A wild card would be Asad Umar, former chief executive of Engro, 
which he grew from a mid-size company to one of the continent's larg- 
est conglomerates, spanning agrochemicals to energy. Young, cricket- 
mad and popular with the public, Umar quit his job to join former 
cricketer Imran Khan’s political party, Tehreek-e-Insaf, which in May 
came from nowhere to become the country’s third electoral force. 

Umar’s decision to leave business for politics is part of a bigger trend 
among younger, middle-class and highly networked voters to become 
politically active. Unusually for Pakistan, many queued for the best part 
of a day to cast their votes in May’s elections. Even if Hamid stays in his 
post, the government might consider appointing a heavy-hitter to take 
charge of the science-for-growth strategy, and have him or her report to 
Iqbal in the Planning Commission. That person could then get on with 
the task of dragging Pakistan into the twenty-first century. m 


Ehsan Masood is editor of Research Fortnight and teaches science 
policy at Imperial College London. 
e-mail: news@researchresearch.com 
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Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


Bee trade spreads 
parasites 


The international trade in 
beehives distributes dangerous 
parasites around the world. 

William Hughes at 
the University of Sussex, 

UK, and his colleagues 
investigated 48 commercially 
produced hives of buff- 
tailed bumblebees (Bombus 
terrestris). Despite being 
certified as parasite-free 

by their producers, more 
than 75% of the hives 
carried microbial parasites. 
Uninfected bees that were 
fed faeces or pollen collected 
from the commercial hives 
subsequently tested positive 
for parasites and had shorter 
lifespans. 

Worldwide, more than one 
million bumblebee hives are 
shipped every year, and the 
authors suggest that imported 
hives could infect native 
pollinators. 

J. Appl. Ecol. http://dx.doi. 
org/10.1111/1365- 
2664.12134 (2013) 


Odd performance 
under pressure 


When squashed down, 

most materials expand 
outwards, but one class of 
‘metamaterials gets narrower 
instead. 

Katia Bertoldi at Harvard 
University in Cambridge, 
Massachusetts, and her 
colleagues crafted several 
structures using arrangements 
of spherical shells with 
regularly spaced holes. 
Whereas most 
structures are 
engineered to 
avoid mechanical 
instabilities, this 
metamaterial 
exploits them 


by buckling and folding to 
become narrower under 
compression before returning 
to its original shape after 


FISHERIES 


Hazard lights for turtles 


Nets that are lit by ultraviolet lights deter turtles but catch just 
as many fish as unlit nets. 

Vulnerable sea-turtle species are often caught 
unintentionally in nets set out by fisheries (pictured). 
Many commercial fish species cannot see ultraviolet light, 
but several types of sea turtle can, so a team of researchers 
led by John Wang at the University of Hawaii in Honolulu 
tested whether visual cues stopped turtles from being 
trapped. The group placed ultraviolet lights every 5 metres 
along net floatlines and compared catches to those from 
unlit nets. The lights reduced the capture rate of green sea 
turtles (Chelonia mydas) by 40% and, in separate studies in 
a commercial fishery, did not significantly alter the value or 
number of marketable fish caught. 

Biol. Lett. 9, 20130383 (2013) 


These reversible 
architectures could be useful 
in applications such as 
energy-absorbing materials 
or acoustic dampeners, the 
authors suggest. 

Adv. Mater. http:// 
dx.doi.org/10.1002/ 
adma.201301986 (2013) 


a the pressure is removed. 
X-ray images and 
numerical simulations 
showed how the holes 
in the structures (one 

example pictured) 
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distort under increasing strain. 


REMOTE SENSING 


Counting carbon 
from above 


An airborne method for 
measuring biomass in forests 
and other ecosystems is as 
accurate as ground-based 
studies and can be used to 
create a national carbon map. 

Gregory Asner at the 
Carnegie Institution 
for Science in Stanford, 
California, and his colleagues 
measured vegetation across 
2.4% of Panama using an 
aeroplane equipped with a 
laser-based surface-detection 
system. They calibrated results 
by comparing data with 228 
field plots and scaled up 
their analysis using satellite 
observations. 

This produced the first 
high-fidelity map of carbon 
stocks across an entire country. 
The map has an uncertainty of 
10% in areas measured from 
the plane, and 20% on average 
in other areas. The authors 
suggest that laser-based data 
can fill in gaps where there are 
no field data. 

Carbon Balance Mgmt. 8, 7 
(2013) 


IMMUNOLOGY 


Antibody tutors 
travel 


A group of specialized white 
blood cells migrates back and 
forth through lymph nodes 

to teach other immune cells 
how to make a greater variety 
of antibodies against invading 
bacteria and viruses. 

A team led by Michel 
Nussenzweig at the 
Rockefeller University in New 
York City and Gabriel Victora 
at the Whitehead Institute 
for Biomedical Research in 
Cambridge, Massachusetts, 
used fluorescence microscopy 
to monitor the behaviour of 
specialized T cells in lymph 
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nodes in live mice. These 

T cells guide other white blood 
cells, known as B cells, to 
manufacture antibodies. The 
team found, unexpectedly, 
that the specialized T cells 
continually travelled between 
the structures in which 

B cells mature. This constant 
movement exposes stationary 
B cells to diverse T cells 

and might result in B cells 
producing a wider range of 
pathogen-specific antibodies, 
the authors say. 

Science http://dx.doi. 
org/10.1126/science.1241680 
(2013) 


PALAEOANTHROPOLOGY 


Sex determination 
for the Stone Age 


A DNA-sequencing method 
reliably reveals the sex of 
ancient human remains. 

A team led by Pontus 
Skoglund at Uppsala 
University in Sweden analysed 
high-throughput-sequencing 
data from 14 modern humans 
and from the remains of 
16 individuals up to 70,000 
years old. By comparing 
the number of sequences 
that matched the X or Y 
chromosome, the team could 
identify males and females. 
This approach worked even 
when the sequence data were 
deliberately ‘contaminated’ 
by adding present-day DNA 
sequences of the opposite sex. 

The technique determined 
the sex of 14 out of 16 ancient 
individuals studied, including 
several Neanderthals and 
a 5,300-year-old mummy 
(pictured). Two Neolithic 
individuals who had been 
classified as females on the 
basis of their morphology were 
identified as males. 


The method could be useful 
for determining the sex of 
juvenile or highly degraded 
specimens, the researchers say. 
J. Archaeol. Sci. http:// 
dx.doi.org/10.1016/j. 
jas.2013.07.004 (2013) 


OPTICAL PHYSICS 


Images ona 
subatomic scale 


High-speed videos of nuclear 
reactions with frames lasting 
less than one-billionth of 
one-billionth of a second may 
eventually be possible, owing 
to a theoretical scheme for 
making ultrashort laser pulses. 
Longer pulses of light lasting 
several attoseconds (1078 
seconds) are already used 
to capture high-resolution 
films of atomic and electronic 
processes. The pulses are 
typically generated from 
X-rays that are emitted when 
electrons are scattered by 
infrared lasers and then 
recombine with parent atoms. 
Carlos Hernandez-Garcia 
of the University of Colorado 
Boulder and his colleagues 
suggest an even more sensitive 
technique that involves 
measuring the interference 
pattern between X-ray pulses 
emitted from electrons that 
have been scattered and 
recombined multiple times, 
potentially generating pulses 
that last only hundreds of 
zeptoseconds (1 zeptosecond 
is 10 seconds). Such a 
timescale could be used to 
image subatomic processes. 
Phys. Rev. Lett. 111,033002 
(2013) 


How mussels flex 
for impact 


The thin threads that attach 
mussels to rocks and fast- 
moving ships have the perfect 
ratio of components to absorb 
the battering forces of the sea. 
Civil engineers Markus 
Buehler and Zhao Qin of the 
Massachusetts Institute of 
Technology in Cambridge 
measured the strength of 
the mollusc’s ‘byssal threads, 
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COMMUNITY 


CHOICE 


The most viewed 


papers in science 


Mice screened gene by gene 


3 HIGHLY READ 
on www.cellpress 
com in July 


A systematic screen of engineered mouse 
lines found that mutations in previously 
unstudied genes were just as likely to 


produce effects as those in genes that had 
already been explored in other scientific publications. 

Karen Steel at the Wellcome Trust Sanger Institute in 
Cambridge, UK, and her colleagues analysed 489 genes by 
systematically disabling each one in a separate mouse line. 
More than 40% of these genes were found to be essential for 


survival. 


Adult mice from 250 of the lines were run through a wide 
range of tests, revealing traits such as reduced grip strength, 
fused joints, delayed response to heat, impaired hearing and 


altered blood chemistry. 


Many results were unexpected: for example, disabling a 
gene that is thought to be involved in hearing led instead 
to increased body weight. Both the data and mutants are 
publicly available, and multiple groups are working together 
to perform similar tests on thousands more genes. 


Cell 154, 452-464 (2013) 


secreted protein fibres that 
extend from its shell to a hard 
surface. Using computer 
simulations, they found 

that the 4:1 ratio of soft 

to stiff material in natural 
threads minimizes the force 
experienced where threads 
join the shell, allowing the 
attachment to withstand 
crashing wave forces up to 
nine times the strength of the 
threads as measured with a 
stationary load. 

The authors suggest that the 
insights could help researchers 
to design earthquake-resistant 
buildings, which, like mussels, 
must withstand fluctuating 
forces. 

Nature Commun. 4, 2187 
(2013) 


PHYSIOLOGY 


Temperature fits 
the bill 


In baby birds, ambient 
temperature can set beak size. 
Gary Burness of Trent 
University in Peterborough, 
Canada, and his colleagues 
raised 40 Japanese quail 
hatchlings (Coturnix 


japonica; 


pictured) 
at high 
(30°C) or low 

(15°C) temperatures. After 
51 days, the hatchlings raised 
in warm conditions had 
longer bills than the chilled 
chicks. Beak growth in cold- 
reared chicks caught up after 
all birds were moved to an 
intermediate temperature, 
but the surface temperature 
of their beaks was, on 
average, 7.4°C lower than 
that of birds reared at warmer 
temperatures, suggesting less 
blood flow. 

Cold conditions soon after 
hatching may irreversibly 
inhibit blood flow or blood- 
vessel development in beaks. 
Proc. R. Soc. B. 280, 20131436 
(2013) 
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SEVEN DAYS nescnnss 


POLICY 


Carp invasion 

Adding electrified barriers 
and developing high-power 
water guns are part ofa US 
plan announced on 24 July to 
prevent invasive Asian carp 
from reaching the Great Lakes. 
The carp, now abundant in the 
connecting Mississippi River, 
are crowding out fish native to 
the region such as the gizzard 
shad (Dorosoma cepedianum). 
The powerful jumpers have 
also injured humans and 
damaged boats. So far, the 
government has spent more 
than US$200 million to keep 
the carp out of the Great Lakes. 


Publishing block 


A court order is preventing 

the publication of work that 
identified security weaknesses 
in software used to immobilize 
cars, it emerged last week. 
Researchers at the University 
of Birmingham, UK, and 
Radboud University Nijmegen, 
the Netherlands, were due 

to present the findings this 
month at the USENIX Security 
Symposium in Washington 
DC. But they have pulled out 
after Volkswagen's parent 
company obtained an interim 
injunction from the British 
High Court on 25 June. On 

29 July, Radboud said it found 
the ban “incomprehensible” 
but would respect the ruling; 
Birmingham also said it would 
defer publication of the paper. 


Drug import block 
On 23 July, a US appeals 

court ruled that the Food and 
Drug Administration (FDA) 
cannot allow imports of an 
unapproved drug used as an 
anaesthetic for the execution 
of prisoners. The drug, sodium 
thiopental, is no longer 
available from authorized US 
manufacturers, forcing state 
officials to order the drug from 
sources abroad (see Nature 
http://doi.org/ds64d9; 2011). 


Sun-scope opens its eyes 


NASA has revealed the first images from its 
latest solar observatory, the Interface Region 
Imaging Spectrograph (IRIS). The instrument 
was launched into orbit in June (see Nature 
498, 279-280; 2013) to watch the flow of 
matter and energy in the Sun’s chromosphere, 
a 1,700-kilometre-thick region between its 


The FDA had argued that it 

has the discretion to allow the 
import of unapproved drugs, 
noting that it also uses this 
discretion to alleviate shortages 
of other drugs. 


Wetlands suit 


A state board that oversees 
flood-protection efforts in 
Louisiana is suing some 100 
energy companies, alleging 
that pipelines and access 
canals have damaged coastal 
wetlands. The Southeast 
Louisiana Flood Protection 
Authority-East filed the 
lawsuit on 24 July, arguing 
that the wetlands serve as a 
buffer zone that provides vital 
protection against flooding. 
On 23 July, a natural-gas well 
being drilled by the Walter 
Oil & Gas Corporation of 
Houston, Texas, exploded 
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in the Gulf of Mexico off the 
Louisiana coast. 


| __BUSINESS 
Open clinical data 


The pharmaceutical industry 
signalled its willingness last 
week to open vast stores 

of clinical-research data to 
scientists. In a statement 

on 24 July, the European 
Federation of Pharmaceutical 
Industries and Associations 
and the Pharmaceutical 
Research and Manufacturers 
of America — two of the 
world’s leading trade bodies 
— said that they would 
“dramatically increase” the 
amount of information 
released by companies, 
including patient-level data 
and detailed clinical-study 
reports. Some campaigners 
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surface, known as the photosphere, and its outer 
atmosphere, called the corona. It takes much 
higher-resolution pictures than those captured 
by NASA’s Solar Dynamic Observatory 
(pictured, yellow sphere), with features as small 
as 240 kilometres across made visible (pictured, 
black and white). 


for greater openness in clinical 
trials said that the statement 
did not go far enough. See 
go.nature.com/83ww6h for 
more. 


Pharma buyout 
Health-care supplier Perrigo 
will buy Elan, a pharmaceutical 
company based in Dublin, 

for US$8.6 billion, Perrigo 
announced on 29 July. Elan is 
best known for developing the 
multiple-sclerosis drug Tysabri 
(natalizumab), but spun off 

its research and development 
arm last year. It placed itself 

on the market after fending 
offa hostile takeover attempt 
earlier this year. Perrigo, which 
is based in Allegan, Michigan, 
and specializes in selling over- 
the-counter drugs, will receive 
a tax break for relocating its 
base to Ireland. 


NASA/SDO/IRIS 
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SOURCE: PWC/NVCA/THOMSON REUTERS 


Scripps head 


Margaret Leinen will take the 
helm of the Scripps Institution 
of Oceanography at the 
University of California, San 
Diego (UCSD), the university 
announced on 25 July. She 
will replace former director 
Tony Haymet on 1 October. 
Leinen, an oceanographer 
and president-elect of the 
American Geophysical 
Union, ran the geosciences 
directorate at the US National 
Science Foundation for seven 
years, and most recently 

led the Harbor Branch 
Oceanographic Institute in 
Fort Pierce, Florida. She says 
that her priorities as director 
will include strengthening 
academic partnerships 
between Scripps and the rest of 
UCSD in the face of tightening 
budgets. 


Sexologist dies 


Pioneering sex researcher 
Virginia Johnson (pictured) 
died last week aged 88, news 
outlets reported on 25 July. 
Johnson and her long-time 
collaborator William Masters 
were known for their landmark 
studies of the human sexual 
response, including filming 
changes inside the vagina 
during female arousal and 
orgasm. On the basis of 
observations that the vagina 
conforms to the penis, Johnson 
and Masters posited that 


TREND WATCH 


Worldwide investment in the 
US biotechnology industry 
rose in the second quarter of 


this year, according to a report 
from PricewaterhouseCoopers 


and the US National Venture 
Capital Association. Venture 


capitalists pumped US$1.3 billion 


into the sector, an increase of 
68% over the same period last 
year. Meanwhile, investment 


in medical devices declined by 
20%, and overall venture-capital 


investment across industries 


fell by 9% relative to the second 


quarter of 2012. 


penis size should not bea 
prime determinant of female 
satisfaction (see Nature http:// 
doi.org/m92; 2013). 


NASA top scientist 
Planetary geologist Ellen Stofan 
will be NASAs next chief 
scientist, the agency announced 
on 29 July. An honorary 
professor at University College 
London and vice-president 

of Proxemy Research in 
Laytonsville, Maryland, Stofan 
held various senior positions 

at NASA from 1991 to 2000. 
Replacing Waleed Abdalati as 
the agency’s lead adviser on 
science programmes, Stofan 
will have to deal with severe 
budget cuts that have been 
proposed for the planetary 
sciences division. 


EFSA resignation 


The European Food Safety 
Authority (EFSA) announced 
the resignation of its executive 


director on 24 July. Catherine 
Geslain-Lanéelle, who has led 
the agency since 2006, will take 
up anew post on 1 September 
as France's director general 

for agricultural, agri-food 

and territorial policies at 

the Ministry of Agriculture, 
Food and Forestry. Under 

her leadership, the EFSA 
introduced tougher conflict- 
of-interest policies for staff and 
outside experts. In January, 

the agency also announced a 
transparency initiative to make 
the data underlying its risk 
assessments publicly available. 


| RESEARCH 
Big data satellite 


Alphasat, the most massive 
telecommunications satellite 
ever built in Europe, was 
launched on 25 July. Operated 
jointly by the European Space 
Agency and satellite company 
Inmarsat, the 6.6-tonne 
spacecraft will test an 
experimental device to relay 
messages between satellites 
using laser beams, which 
could vastly increase the rate at 
which data are communicated. 
The satellite will primarily 
communicate with Earth 
using radio waves in the 
commonly used L band of the 
electromagnetic spectrum, but 
it also carries an experiment 
for broadcasting in the less- 
crowded, high-frequency 
Q-V band. See go.nature.com/ 
wjelis for more. 


BIOTECH BUCKS DOWNWARD TREND 


Venture-capital funding for US biotechnology firms has remained 
steady, although overall investment is on a gradual decline. 
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The Ecological Society 
of America discusses 
managing sustainable 
ecosystems and data- 
intensive research at 
its annual meeting in 
Minneapolis. 
www.esa.org/minneapolis 


Stem-cell network 
The California Institute 

for Regenerative Medicine 
(CIRM), based in San 
Francisco, approved plans on 
25 July for a US$70 million 
network of clinics to test 
stem-cell therapies. The 
‘Alpha Clinics’ network will 
consist of up to five sites and 
a coordinating centre and 
will focus on therapies that 
involve transplanting or 
infusing stem cells. CIRM 
says that the network could 
help researchers to resolve 
technical and regulatory issues 
involved in early cellular 
therapy trials and educate 
patients who might otherwise 
turn to untested and risky 
treatments. 


Cigarette appeal 


Flavour additives and 
cigarette packaging may 
influence smoking habits, 
according to two reports 
released last week. Ina 
preliminary scientific 
evaluation, the US Food and 
Drug Administration said 
on 23 July that cigarettes 
containing menthol may 
pose a greater public-health 
risk than non-mentholated 
cigarettes, probably by 
increasing the take-up of 
smoking and reducing success 
in quitting. On 22 July, 
researchers in Australia 
reported that plain packaging 
for cigarettes is associated 
with lower smoking appeal 
and a greater urgency to quit 
(M. A. Wakefield et al. BMJ 
Open 3, e003175; 2013). 
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Labs vie for X-ray source 


California facilities respond to US panel’s call for a powerful free-electron laser. 


BY EUGENIE SAMUEL REICH 


he competition to build the world’s most 
| powerful source of X-rays is becoming 
as intense as the light pulses it might 
one day emit, with two US Department of 
Energy (DOE) laboratories vying to host the 
proposed machine. The two California labs — 
the Lawrence Berkeley National Laboratory in 
Berkeley and the SLAC National Accelerator 
Laboratory in Menlo Park — each had hopes of 
getting its own project funded. But on 25 July, 
advisers to the DOE said that building just one, 
cutting-edge light source would be better. 
“Instead of an argument, wed like to have a 
unified proposal,” says William Barletta, a physi- 
cist at the Massachusetts Institute of Technology 
in Cambridge who served on the DOE advisory 
committee, which met last week in Bethesda, 
Maryland, to put forward the light-source 


recommendations. “You want to make a 
revolutionary machine that really stands out” 

X-ray light sources provide an imaging tech- 
nology that serves a wide range of scientists, 
including biologists, chemists and materials 
scientists. The DOE has led the area in recent 
decades, building four powerful synchrotrons, 
which produce intense X-rays as electrons zip 
around circular paths. The department has also 
built a pioneering free-electron laser, which 
uses special undulator magnets to wiggle an 
electron beam so that it emits intense, laser- 
like pulses of X-ray light. A European collabo- 
ration is constructing a free-electron laser in 
Germany, and Sweden, Japan and Brazil are 
working on designs for synchrotrons that 
would compete with the US machines. In Janu- 
ary, the DOE commissioned a review to try to 
sustain US leadership in the field. 

The panel's resulting report throws down the 


© 2013 Macmillan Publishers Limited. All rights reserved 


gauntlet to light-source designers at the two 
competing labs to cooperate on a design for a 
single free-electron laser. In 2011, the Berkeley 
lab won initial approval from the DOE to begin 
work on a free-electron laser called the Next 
Generation Light Source (NGLS), whereas 
SLAC had hoped to upgrade its Linac Coher- 
ent Light Source (LCLS), the free-electron laser 
that it currently operates. 

Managers at the labs had said that there 
was room for two complementary machines. 
The NGLS would produce lower-energy, ‘soft 
X-rays — ideal for imaging biological matter, 
chemical reactions and electron movement in 
the outer shells of atoms. Its particular strength 
would be a high repetition rate for its X-ray 
pulses, which would enable multiple snap- 
shots of electron motion and the creation of 
‘molecular movies’ of chemical processes. By 
contrast, the LCLS upgrade would extend 
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SHINING LIGHTS 


Proposed project Repetition rate 


LCLS upgrade 1 kHz 
GLS 1,000 kHz 
Panel recommendation 100 kHz 


ndividually, proposals for the Linac Coherent Light Source (LCLS) and the Next Generation Light Source 
(NGLS) do not meet the specifications called for by a panel that advises the US Department of Energy. 


Upper energy limit 
25,000 eV 

720 eV 

5,000 eV 


ilohertz (kHz); electronvolts (eV) 


> the machine's capability to produce higher- 
energy, ‘hard’ X-rays, which could image the 
arrangement of atoms and penetrate deep into 
heavier elements. But its intense pulses would 
come at a lower repetition rate, ruling out 
movies of dynamic processes. 

Both approaches would be likely to attract 
users. Lou DiMauro, an atomic physicist at 
Ohio State University in Columbus, is keen to 
run ‘pump-probe’ experiments, in which an 
initial X-ray pulse is used to excite an atom, and 
the next pulse is used to probe the atoms state. 
The closely spaced pulses of the NGLS design 
would be ideal for that. But Phil Bucksbaum, 
an atomic physicist at Stanford University in 
California, who uses the SLAC light source, 
says that the NGLS would not be able to probe 
heavier elements because it operates at too low 
an energy compared with an upgraded LCLS. 

The DOE advisory group found that the 
broadest science case could be met by a single 
facility that combines the strengths of both the 
NGLS and the LCLS (see ‘Shining lights’). The 
committee’s recommendations will help the 
energy department to respond to members of 
the US Congress who have asked for a more 
compelling case for a future free-electron laser. 


Building two, smaller, less-capable machines 
“js not the best science per dollar’, says Barletta. 
In response to the panel’s recommendation, 
the Berkeley lab and SLAC have been scram- 
bling to extend the reach of their proposals, 
and jockeying to be the front-runner to host 
a single site. Paul Alivisatos, director of the 
Berkeley lab, says that the NGLS design had 
an upper energy 


“You want limit of 720 electron- 
tomakea volts (eV) to keep 
revolutionary project costs below 
machine US$700 million. 
that really Increasing the budget 


to $1.2 billion would 
allow the electron- 
beam accelerator to be lengthened and would 
boost the upper energy limit to 3,000 eV, not 
far from the advisory panel's desired level of 
5,000 eV. “It’s a straightforward extension of 
our proposal,” he says. 

Uwe Bergmann, associate director of the 
LCLS, says that an upgrade to his machine could 
get it to a repetition rate of 10 kilohertz (kHz), 
but the current proposal boosts it to only 
1 kHz. To get near the panel's recommendation 
of 100 kHz, he acknowledges that his facility 


stands out.” 


would need to replace its accelerator with a 
superconducting one — a key feature of the 
Berkeley lab’s proposal. But ultimately, he says, 
the idea of upgrading an existing machine may 
be more realistic in a cost-constrained environ- 
ment than the advisory panel’s ambitious vision. 
“A committee suggests something — but the 
committee doesn't foot the bill” 

The tight fiscal climate has exacerbated com- 
petition between the two proposals, says accel- 
erator physicist Michael Borland of Argonne 
National Laboratory in Illinois. “There is limited 
funding, and government agencies need to 
decide which machine to build first,” he says. 
But he sees at least one way to combine the two 
projects: the electron source and superconduct- 
ing linear accelerator from the NGLS proposal 
could be put in the existing LCLS tunnel to take 
advantage of its undulator magnets. “This seems 
to make more sense than starting from scratch 
with a higher-energy NGLS” 

For users, a new plan cannot come too soon, 
says Thomas Russell, a polymer scientist at the 
University of Massachusetts Amherst. Russell 
wants to use a fast-repeating X-ray source 
to watch the crystallization of photoactive 
materials used in solar cells. The current LCLS 
is not fast enough to make the movies he wants, 
and moreover, as the premier free-electron laser 
in the United States, the LCLS turns away four 
scientists for every one that is granted time. He 
has visited all four of the US X-ray synchrotrons, 
but the diffuse nature of their light would make 
it impossible for him to understand his crystal 
structures. “You reach a certain limit and you 
just can't do the experiment you want to do,’ 
he says. “The light sources that exist just don't 
provide enough oomph.” = 


BIOMEDICINE 


NIH mulls rules for 
validating Key results 


US biomedical agency could enlist independent labs for verification. 


BY MEREDITH WADMAN 


apparently reproducible: a steady stream of 
studies that show the irreproducibility of 
many important experiments. 

In a 2011 internal survey, pharmaceuti- 
cal firm Bayer HealthCare of Leverkusen, 
Germany, was unable to validate the relevant 
preclinical research for almost two-thirds of 
67 in-house projects. Then, in 2012, scientists 
at Amgen, a drug company based in Thousand 


E biomedical science, at least one thing is 
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Oaks, California, reported their failure to 
replicate 89% of the findings from 53 land- 
mark cancer papers. And in a study published 
in May, more than half of the respondents to a 
survey at the MD Anderson Cancer Center in 
Houston, Texas, reported failing at least once 
in attempts at reproducing published data (see 
“Make believe’). 

The growing problem is threatening the 
reputation of the US National Institutes of 
Health (NIH) based in Bethesda, Maryland, 
which funds many of the studies in question. 
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Senior NIH officials are now considering 
adding requirements to grant applications to 
make experimental validations routine for cer- 
tain types of science, such as the foundational 
work that leads to costly clinical trials. As the 
NIH pursues such top-down changes, one 
company is taking a bot- 
tom-up approach, target- 
ing scientists directly to 
see if they are willing to 
verify their experiments. 

There is the looming 


NATURE.COM 
For more on the 
challenges of 
reproducibility: 
go.nature.com/zqtrnp 
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(NGLS) do not meet the specifications called for by a panel that advises the US Department of Energy. 
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energy, ‘hard’ X-rays, which could image the 
arrangement of atoms and penetrate deep into 
heavier elements. But its intense pulses would 
come at a lower repetition rate, ruling out 
movies of dynamic processes. 

Both approaches would be likely to attract 
users. Lou DiMauro, an atomic physicist at 
Ohio State University in Columbus, is keen to 
run ‘pump-probe’ experiments, in which an 
initial X-ray pulse is used to excite an atom, and 
the next pulse is used to probe the atoms state. 
The closely spaced pulses of the NGLS design 
would be ideal for that. But Phil Bucksbaum, 
an atomic physicist at Stanford University in 
California, who uses the SLAC light source, 
says that the NGLS would not be able to probe 
heavier elements because it operates at too low 
an energy compared with an upgraded LCLS. 

The DOE advisory group found that the 
broadest science case could be met by a single 
facility that combines the strengths of both the 
NGLS and the LCLS (see ‘Shining lights’). The 
committee’s recommendations will help the 
energy department to respond to members of 
the US Congress who have asked for a more 
compelling case for a future free-electron laser. 
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get it to a repetition rate of 10 kilohertz (kHz), 
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would need to replace its accelerator with a 
superconducting one — a key feature of the 
Berkeley lab’s proposal. But ultimately, he says, 
the idea of upgrading an existing machine may 
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“A committee suggests something — but the 
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The tight fiscal climate has exacerbated com- 
petition between the two proposals, says accel- 
erator physicist Michael Borland of Argonne 
National Laboratory in Illinois. “There is limited 
funding, and government agencies need to 
decide which machine to build first,” he says. 
But he sees at least one way to combine the two 
projects: the electron source and superconduct- 
ing linear accelerator from the NGLS proposal 
could be put in the existing LCLS tunnel to take 
advantage of its undulator magnets. “This seems 
to make more sense than starting from scratch 
with a higher-energy NGLS” 

For users, a new plan cannot come too soon, 
says Thomas Russell, a polymer scientist at the 
University of Massachusetts Amherst. Russell 
wants to use a fast-repeating X-ray source 
to watch the crystallization of photoactive 
materials used in solar cells. The current LCLS 
is not fast enough to make the movies he wants, 
and moreover, as the premier free-electron laser 
in the United States, the LCLS turns away four 
scientists for every one that is granted time. He 
has visited all four of the US X-ray synchrotrons, 
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many important experiments. 
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preclinical research for almost two-thirds of 
67 in-house projects. Then, in 2012, scientists 
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Oaks, California, reported their failure to 
replicate 89% of the findings from 53 land- 
mark cancer papers. And in a study published 
in May, more than half of the respondents to a 
survey at the MD Anderson Cancer Center in 
Houston, Texas, reported failing at least once 
in attempts at reproducing published data (see 
“Make believe’). 

The growing problem is threatening the 
reputation of the US National Institutes of 
Health (NIH) based in Bethesda, Maryland, 
which funds many of the studies in question. 
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Senior NIH officials are now considering 
adding requirements to grant applications to 
make experimental validations routine for cer- 
tain types of science, such as the foundational 
work that leads to costly clinical trials. As the 
NIH pursues such top-down changes, one 
company is taking a bot- 
tom-up approach, target- 
ing scientists directly to 
see if they are willing to 
verify their experiments. 

There is the looming 
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For more on the 
challenges of 
reproducibility: 
go.nature.com/zqtrnp 


SCIENCE EXCHANGE 


Reproducibility problems have led the 
National Institutes of Health to consider 
verification rules for some experiments. 


NR Bayer HealthCare 


finds inconsistencies between in-house 
and published data in almost two-thirds 
of 67 projects. 


Amgen publication 
shows that the findings from only 
11% of 53 landmark papers can be 
replicated by company scientists. 


SU AAU Elizabeth lorns 
(pictured), chief executive of Science 
Exchange, asks more than 22,000 
study authors if they want her 
company to arrange verification of their 
experiments; nearly 2,000 say yes. 


EVGA EI A survey at the MD 


Anderson Cancer Center finds that more 
than half of its respondents have tried 
and failed to reproduce published data. 


SLU er RE Science Exchange 


launches a verification programme for 
commercially sold antibodies. 


question, however, of who will pay for it all. 
Independently validating the results ofa major 
paper that has in vitro and animal experiments 
can cost US$25,000, says Elizabeth Iorns, chief 
executive of Science Exchange, a company in 
Palo Alto, California, that matches scientists 
with verification service providers. 

Last year, the NIH convened two workshops 
that examined the issue of reproducibility, and 
last October, the agency’s leaders and others 
published a call for higher standards in the 
reporting of animal studies in grant applica- 
tions and journal publications. Ata minimum, 
they wrote, studies should report on whether 
and how animals were randomized, whether 
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investigators were blind to the treatment, how 
sample sizes were estimated and how data were 
handled. 

The NIH is just beginning to take active 
measures, says Lawrence Tabak, the agency’s 
principal deputy director. “There is certainly 
sufficient information now that the NIH feels 
it's appropriate to look at this at a central- 
agency level,” he says. This summer, he and 
other senior NIH officials, including Story 
Landis, director at the neurology institute, and 
Harold Varmus, director at the cancer institute, 
are assessing input gathered from the directors 
of the agency’s 27 institutes and centres. They 
will then confer with NIH director Francis 
Collins, who will decide what steps to take. 

Proposals under consideration include 
modifying peer review to bring greater scru- 
tiny to the work a grant application is based on 
— perhaps just for applications that are likely 
to lead to clinical trials. In a June meeting of 
Collins's advisory committee, Tabak imag- 
ined implementing such a scenario. “If the 
premise isn’t validatable, then we're done; it 
doesn't matter how well you wrote the grant,” 
he said. Agency officials are also considering 
a requirement that independent labs validate 
the results of important preclinical studies as a 
condition of receiving grant funding. 

The very idea of a validation requirement 
makes some scientists queasy. “It’s a disas- 
ter,’ says Peter Sorger, a systems biologist at 
Harvard Medical School in Boston, Massachu- 
setts. He says that frontier science often relies 
on ideas, tools and protocols that do not exist 
in run-of-the-mill labs, let alone in companies 
that have been contracted to perform verifica- 
tion. “It is unbelievably difficult to reproduce 
cutting-edge science,” he says. 

But others say that independent validation 
is a must to counteract the pressure to pub- 
lish positive results and the lack of incentives 
to publish negative ones. Iorns doubts that 
tougher reporting requirements will make 
any real impact, and thinks that it would be 
better to have regular validations of results, 
either through random audits or selecting the 
highest-profile papers. 

Science Exchange would clearly benefit from 
anew flow of business should the NIH impose 
such a mandate on even some studies. The com- 
pany’s Reproducibility Initiative, launched last 
year, arranges for the independent replication of 
study results for authors who request the service 
— and the first batch of academic validations 
began in May. In January, Iorns asked more 
than 22,000 corresponding authors of original 
biomedical papers published in 2012 if they 
would allow the experiments in their reports to 
be independently verified, should funding be 
made available. Of those who responded, 1,892 
scientists said yes and 416 declined. 

Iorns says that there are plenty of important 
papers among those published by the 1,892 
— they had a readership that was, on average, 
an order of magnitude higher than that of > 
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> the papers whose authors declined, as 
measured by downloads to a free online ref- 
erence manager, Mendeley. The Laura and 
John Arnold Foundation, based in Houston, 
Texas, says that it is actively considering 
funding Science Exchange to validate can- 
cer-cell biology papers within Iorns’ cohort. 

Some at the NIH are coming round to 
the idea that validation is best contracted 
out. Shai Silberberg, who is responsible 
for reproducibility issues at the agency’s 
neurology institute, has almost finished a 
pilot study in which several academic labs 
tried to reproduce findings from studies 
aiming to move drugs to a stage at which 
they are ready to be tested in humans. 
He points out that it has already taken two 
and a half years. “It’s too slow,” he says. He 
now favours speedier contract-research 
organizations. 

Iorns, for her part, is not waiting for the 
NIH to take action. On 30 July, Science 
Exchange launched a programme with 
reagent supplier antibodies-online.com, 
based in Aachen, Germany, to indepen- 
dently validate research antibodies. These 
are used, for example, to probe gene func- 
tion in biomedical experiments, but their 
effects are notoriously variable. “Having a 
third party validate every batch would bea 
fabulous thing,” says Peter Park, a computa- 
tional biologist at Harvard Medical School. 
He notes that the consortium behind 
ENCODE — a project aimed at identifying 
all the functional elements in the human 
genome — tested more than 200 antibodies 
targeting modifications to proteins called 
histones and found that more than 25% 
failed to target the advertised modification. 

With antibodies, the companies that 
make them have an incentive to prove the 
quality of their products, and Iorns hopes 
that they will pay the thousands of dollars 
that such validation costs. Antibodies that 
pass muster will receive an ‘independently 
validated’ green tick in the antibodies- 
online.com catalogue. 

But with budgets stretched thin — and 
with Congress well aware of the reproduc- 
ibility issues — the NIH also has an incen- 
tive to make sure that its $29-billion budget 
is spent on verifiable science. “We are obli- 
gated to consider how we want to address 
this,” says Tabak. = 
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The tongue is one of many sites to have its bacteria catalogued by the Human Microbiome Project. 
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Microbiome research 
goes without a home 


Scientists say core tools and expertise remain necessary. 


BY BETH MOLE 


r | Arillions of microorganisms call the 
human body home. But ‘home’ for 
many US scientists studying these 

microscopic squatters is about to change, as 

funding for human microbiome research 
scatters across 16 of the 27 centres of the US 

National Institutes of Health (NIH). 

Last year, researchers completed the 
$US173-million Human Microbiome Project, 
which took five years and generated a slew of 
reference data, mostly genetic sequences ofall 
the microbes that dwell on and inside humans. 
But the project's scientists fear that a lack of 
standards and expertise in data-gathering and 
analysis are hampering efforts to extract mean- 
ing from this information. 

At a meeting last week in Bethesda, Mary- 
land, they reiterated that identifying the 


microbes is just the first step. Researchers 
must also focus on how bacteria interact with 
each other and the human body to cause — or 
prevent — disease. Yet these calls for action are 
coming as the project faces significant down- 
sizing: by the beginning of 2014, microbiome 
researchers will no longer be able to depend 
on centralized resources based at the NIH’s 
National Human Genome Research Institute 
(NHGRI) in Bethesda. 

“The microbiome has so much appeal,” says 
Christian Jobin, an immunologist at the Uni- 
versity of Florida in Gainesville, who studies 
the interplay between gut microbes, inflam- 
mation and cancer. “But we're lacking direction 
right now” 

In 2012, the project culminated with a flurry 
of publications (D. A. Relman Nature 486, 194- 
195; 2012). But, says Lita Proctor, the project’s 
programme director, some efforts still seem too 
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Scientists say core tools and expertise remain necessary. 


BY BETH MOLE 


r | Arillions of microorganisms call the 
human body home. But ‘home’ for 
many US scientists studying these 

microscopic squatters is about to change, as 

funding for human microbiome research 
scatters across 16 of the 27 centres of the US 

National Institutes of Health (NIH). 

Last year, researchers completed the 
$US173-million Human Microbiome Project, 
which took five years and generated a slew of 
reference data, mostly genetic sequences ofall 
the microbes that dwell on and inside humans. 
But the project's scientists fear that a lack of 
standards and expertise in data-gathering and 
analysis are hampering efforts to extract mean- 
ing from this information. 

At a meeting last week in Bethesda, Mary- 
land, they reiterated that identifying the 


microbes is just the first step. Researchers 
must also focus on how bacteria interact with 
each other and the human body to cause — or 
prevent — disease. Yet these calls for action are 
coming as the project faces significant down- 
sizing: by the beginning of 2014, microbiome 
researchers will no longer be able to depend 
on centralized resources based at the NIH’s 
National Human Genome Research Institute 
(NHGRI) in Bethesda. 

“The microbiome has so much appeal,” says 
Christian Jobin, an immunologist at the Uni- 
versity of Florida in Gainesville, who studies 
the interplay between gut microbes, inflam- 
mation and cancer. “But we're lacking direction 
right now” 

In 2012, the project culminated with a flurry 
of publications (D. A. Relman Nature 486, 194- 
195; 2012). But, says Lita Proctor, the project’s 
programme director, some efforts still seem too 
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focused on data-gathering and do not account 
for the complex ways in which microbes 
interact with one another and their hosts. Some 
scientists are “struggling to figure out how to 
think about the microbiome’, she says. 

There are several problems, says Rob 
Knight, a microbial ecologist at the University 
of Colorado Boulder. First, scientists need to 
forge standards on matters such as how long to 
wait after a person has a shower before swab- 
bing their skin for microbes, or what a person 
ate before collecting a stool sample. Although 
researchers who gathered the reference data 
sets from healthy individuals tried to estab- 
lish such standards, some of these have been 
ignored, and there is also wide variation in the 
way that microbiome samples are amassed 
from people with diseases. 

Second, collecting, sequencing and analys- 
ing DNA from thousands of microbial species 
living in and on humans requires an interdisci- 
plinary team with knowledge of clinical ethics, 
engineering and bioinformatics — expertise 
that can be tricky to assemble. 

Last, it remains difficult to establish 
whether a microbial trend associated with 
a disease is the cause or a result of that dis- 
ease. One solution, Knight says, would be to 
track people over time, to allow researchers 
to detect microbial changes that occur before 
or after someone becomes ill. Soon, he hopes, 


microbiome information could help doctors 
to predict a patient’s risk of developing various 
diseases, and conditions such as obesity. 
Jobin says few microbiologists are trying to 
understand what the microbes are doing and 
how they might be controlled. This year, he 
sat on a review panel 


“Scientists are for grant applications 
strugs ling to proposing to explore 
figure outhow to the microbiome’s 
think about the 


connection to gas- 
trointestinal cancers, 
and noted that many 
planned simply to sequence microbial com- 
munities. They were swiftly rejected. “There are 
very few mechanism-driven studies,’ he says. 

Expectations are higher now than they were 
in 2003, when the Human Genome Project 
wrapped up, says Owen White, a bioinformat- 
ics researcher at the University of Maryland 
School of Medicine in Baltimore. “When you 
were sequencing the human genome, or the 
next mammalian genome, everyone knew that 
that was relatively hypothesis-free — and that 
was fine,’ he says. 

The Human Genome Project was overseen by 
the NHGRI, which continues to lead genome- 
related work. By contrast, the Human Microbi- 
ome Project does not have a place to call home 
for its second phase. Over its first five years, it 
received $146 million from the NIH common 


microbiome.” 
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fund, money that was managed by the NHGRI 
and that contributed to the development of 
field-wide tools. But in its second phase, from 
2014 to 2016, only $15 million in common-fund 
money will flow through the NHGRI. Micro- 
biome research will instead be largely supported 
by 16 individual NIH institutes. 

The project’s leaders say that the effort still 
needs a base to provide resources such as stand- 
ardized sampling protocols, technical support, 
and microbiome samples and data from spe- 
cific patient groups that researchers can mine. 
Many had assumed that the NHGRI would take 
on this role, because the microbiome project 
was initially seen as an extension of the Human 
Genome Project. But the NHGRI “has been less 
enthusiastic than expected’, Knight says. Jane 
Peterson, a senior adviser to the NHGRI direc- 
tor’s office, says that the future of human micro- 
biome research will be in clinical applications, 
which does not fit with the NHGRI’s mission. 

But Heidi Kong, a dermatologist and micro- 
biome researcher at the National Cancer Insti- 
tute in Bethesda, says that the project’s basic 
science needs to be nurtured further before 
it will be ready for the clinic. Scientists need 
first to pin down the function of individual 
microbes and the body’s response to them, and 
should only then begin testing treatments on 
those interactions. “There is a bit more work 
we need to do,” Kong says. m 
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Pilot projects bury carbon 
dioxide in basalt 


Two experiments test viability of sequestering emissions in porous layers of hard rock. 


BY JEFF TOLLEFSON 


y early August, scientists will have 
Ber 1,000 tonnes of pure carbon 

dioxide into porous rock far below the 
northwestern United States. The goal is to find 
a permanent home for the carbon dioxide 
generated by human activities. 

Researchers at the US Department of 
Energy's Pacific Northwest National Labora- 
tory (PNNL) in Richland, Washington, began 
the injections into the Columbia River Basalt 
formation near the town of Wallula on 17 July. 
The rock contains pores created as many as 
16 million years ago, when magma flowed 
across what is now the Columbia River Basin. 
Bubbles of CO, migrated to the edges of the 
magma as it cooled, forming layers of holes 
sandwiched between solid rock (see ‘Rock 
steady’). 

In pumping emissions back underground, 
“we are returning the carbon dioxide from 
whence it came”, says Pete McGrail, an 
environmental engineer at the PNNL who 
is heading the experiment, part of a larger 
energy-department programme on ways to 
sequester carbon. 

The Wallula project is the second of two 
worldwide to target basalt formations, which 
scientists hope can hold — and permanently 
mineralize — vast quantities of gas. In basalt, 
dissolved CO, should react with calcium and 
magnesium to form limestone over the course 
of decades. Until the gas is locked away, the 
porous basalt layers are capped by solid rock 
that will prevent leaking. That should elimi- 
nate concerns about 


leakage that have “Weare 
dogged other pro- returning the 
posals to store CO, carbon dioxide 
deep underground, from whenceit 
often in sandstone came.” 


reservoirs. 

The basalt reactions are part of a natural 
weathering process that has helped to regulate 
atmospheric CO, levels throughout geological 
time. Scientists have analysed mineralization 
in the lab, but it is only now being tested in 
the field. 

Researchers working on the other basalt 
project, based in Iceland and run by a con- 
sortium of US and European scientists along 
with Reykjavik Energy, made their first CO, 
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ROCK STEADY 


A carbon dioxide sequestration project in Washington 
state is testing whether chemical reactions in porous 
basalt will help to trap gas injected into the rock. 


CO, injection well 
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will hold gas 
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Total well depth 
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injections last year and will conduct another 
round this year. Early results look promising, 
says Juerg Matter, a geochemist at Columbia 
University’s Lamont-Doherty Earth Observa- 
tory in Palisades, New York, who is working 
on the Iceland project. “The mineralization 
reaction is most likely faster than what we 
in the community had thought,” says Mat- 
ter, who has also contributed to the Wallula 
project. Assuming that holds true for basalt 
generally, “you reduce the risk of leakage, and 
you can pretty much walk away from your 
storage reservoirs’. 

In Wallula, researchers are already moni- 
toring a series of shallow wells around the 
injection site for signs of CO, leaking into the 
soil and groundwater. Once the injection is 
finished, they will start taking samples from 
the injection well to monitor water chemistry, 
track changes in carbon isotopes and check 
for other evidence of reactions. Lab tests and 
computer simulations suggest that in general, 
around 20% of the CO, should be mineralized 
within 10-15 years, says McGrail. 

The pilot project, however, is operating ona 
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shorter timescale. Fourteen months after the 
end of injection, the team plans to drill another 
well and pull up a core of rock to assess the 
results, says McGrail. “At that point, we are 
hoping to have some carbonized rock in our 
hands.” 

But achieving sequestration is only half the 
battle: scientists and engineers must still work 
out how to capture CO, from industrial facili- 
ties and transport it to the sequestration site 
cost-effectively. And even if a carbon-miner- 
alization industry took off, establishing it on a 
global level would require an undertaking on 
the scale of rebuilding the oil industry. 

Scientific opinions differ over whether it 
would be more desirable to stop burning fossil 
fuels than to undertake massive carbon-seques- 
tration ventures — but if sequestration were to 
be favoured, many think that basalt could be 
important. And although backers of large-scale 
basalt sequestration have so far explored for- 
mations in the US northwest and southeast as 
well as in India, many are also looking offshore, 
where the sea floor could accommodate CO, 
emissions for centuries to come. 

Carbon-sequestration research has until 
now tended to focus on sandstone reservoirs 
rather than basalt. There are two main rea- 
sons, says David Goldberg, a marine geologist 
at Lamont-Doherty. The oil industry is used 
to working with sandstone, and such forma- 
tions are relatively common — making it easier 
to transport CO, from a power plant or other 
source to a sequestration site. That might mean 
that sandstone is more economically viable 
than ocean basalt, at least in the short term. 
But Goldberg says that the best place to bury 
globally significant volumes of CO, is offshore, 
where they will be safely capped by sediments 
and sea water. 

A single formation off the US west coast, 
with an estimated storage volume of 685 cubic 
kilometres, has the potential to hold all the 
CO, emissions the country produces in a cen- 
tury, Goldberg notes. “If we can make it work,” 
he says, “the oceans have a lot of advantages.” 

But none of this will be cheap, says Kevin 
Johnson, a geochemist at the University of 
Hawaii in Honolulu who has worked on lab 
experiments with McGrail’s team. “It’s a ques- 
tion of social importance — and whether the 
climate situation gets dire enough to justify the 
cost.” m 
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The milk revolution 


When a single genetic mutation first let ancient Europeans 
drink milk, it set the stage for a continental upheaval. 


was excavating a Stone Age site in the fer- 

tile plains of central Poland when he came 
across an assortment of odd artefacts. The 
people who had lived there around 7,000 years 
ago were among central Europe's first farmers, 
and they had left behind fragments of pottery 
dotted with tiny holes. It looked as though the 
coarse red clay had been baked while pierced 
with pieces of straw. 

Looking back through the archaeologi- 
cal literature, Bogucki found other examples 
of ancient perforated pottery. “They were so 
unusual — people would almost always include 
them in publications,’ says Bogucki, now at 
Princeton University in New Jersey. He had seen 
something similar at a friend’s house that was 
used for straining cheese, so he speculated that 
the pottery might be connected with cheese- 
making. But he had no way to test his idea. 

The mystery potsherds sat in storage until 
2011, when Mélanie Roffet-Salque pulled 
them out and analysed fatty residues preserved 
in the clay. Roffet-Salque, a geochemist at the 
University of Bristol, UK, found signatures of 
abundant milk fats — evidence that the early 


[: the 1970s, archaeologist Peter Bogucki 
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BY ANDREW CURRY 


farmers had used the pottery as sieves to sepa- 
rate fatty milk solids from liquid whey. That 
makes the Polish relics the oldest known evi- 
dence of cheese-making in the world’. 
Roffet-Salque’s sleuthing is part of a wave of 
discoveries about the history of milk in Europe. 
Many of them have come from a €3.3-million 
(US$4.4-million) project that started in 2009 
and has involved archaeologists, chemists and 
geneticists. The findings from this group illumi- 
nate the profound ways that dairy products have 
shaped human settlement on the continent. 
During the most recent ice age, milk was 
essentially a toxin to adults because — unlike 
children — they could not produce the lactase 
enzyme required to break down lactose, the 
main sugar in milk. But as farming started to 
replace hunting and gathering in the Middle 
East around 11,000 years ago, cattle herders 
learned how to reduce lactose in dairy prod- 
ucts to tolerable levels 
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that gave people the ability to produce lactase 
— and drink milk — throughout their lives. 
That adaptation opened up a rich new source 
of nutrition that could have sustained commu- 
nities when harvests failed. 

This two-step milk revolution may have 
been a prime factor in allowing bands of 
farmers and herders from the south to sweep 
through Europe and displace the hunter-gath- 
erer cultures that had lived there for millen- 
nia. “They spread really rapidly into northern 
Europe from an archaeological point of view,’ 
says Mark Thomas, a population geneticist at 
University College London. That wave of emi- 
gration left an enduring imprint on Europe, 
where, unlike in many regions of the world, 
most people can now tolerate milk. “It could 
be that a large proportion of Europeans are 
descended from the first lactase-persistent 
dairy farmers in Europe,’ says Thomas. 


STRONG STOMACHS 

Young children almost universally produce 
lactase and can digest the lactose in their 
mother’s milk. But as they mature, most switch 
off the lactase gene. Only 35% of the human 
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population can digest lactose beyond the age of 
about seven or eight (ref. 2). “If you're lactose 
intolerant and you drink half a pint of milk, 
youre going to be really ill. Explosive diarrhoea 
— dysentery essentially,’ says Oliver Craig, an 
archaeologist at the University of York, UK. “I'm 
not saying it’s lethal, but it’s quite unpleasant.” 

Most people who retain the ability to digest 
milk can trace their ancestry to Europe, where 
the trait seems to be linked to a single nucleo- 
tide in which the DNA base cytosine changed 
to thymine in a genomic region not far from 
the lactase gene. There are other pockets of 
lactase persistence in West Africa (see Nature 
444, 994-996; 2006), the Middle East and 
south Asia that seem to be linked to separate 
mutations’ (see ‘Lactase hotspots’). 

The single-nucleotide switch in Europe 
happened relatively recently. Thomas and his 
colleagues estimated the timing by looking at 
genetic variations in modern populations and 
running computer simulations of how the 
related genetic mutation might have spread 
through ancient populations’. They proposed 
that the trait of lactase persistence, dubbed the 
LP allele, emerged about 7,500 years ago in the 
broad, fertile plains of Hungary. 


POWERFUL GENE 

Once the LP allele appeared, it offered a major 
selective advantage. In a 2004 study’, researchers 
estimated that people with the mutation would 
have produced up to 19% more fertile offspring 
than those who lacked it. The researchers called 
that degree of selection “among the strongest yet 
seen for any gene in the genome”. 

Compounded over several hundred gen- 
erations, that advantage could help a popula- 
tion to take over a continent. But only if “the 
population has a supply of fresh milk and is 
dairying”, says Thomas. “It’s gene-culture co- 
evolution. They feed off of each other” 

To investigate the history of that interaction, 
Thomas teamed up with Joachim Burger, a 
palaeogeneticist at the Johannes Gutenberg 
University of Mainz in Germany, and Matthew 
Collins, a bioarchaeologist at the University 
of York. They organized a multidisciplinary 
project called LeCHE (Lactase Persistence 
in the early Cultural History of Europe), 
which brought together a dozen early-career 
researchers from around Europe. 

By studying human molecular biology and 
the archaeology and chemistry of ancient 
pottery, LeCHE participants also hoped to 
address a key issue about the origins of mod- 
ern Europeans. “It’s been an enduring question 
in archaeology — whether we're descended 
from Middle Eastern farmers or indigenous 
hunter-gatherers,” says Thomas. The argument 
boils down to evolution versus replacement. 
Did native populations of hunter-gatherers in 
Europe take up farming and herding? Or was 
there an influx of agricultural colonists who 
outcompeted the locals, thanks to a combina- 
tion of genes and technology? 


One strand of evidence came from studies of 
animal bones found at archaeological sites. If 
cattle are raised primarily for dairying, calves 
are generally slaughtered before their first birth- 
day so that their mothers can be milked. But cat- 
tle raised mainly for meat are killed later, when 
they have reached their full size. (The pattern, 
if not the ages, is similar for sheep and goats, 
which were part of the dairying revolution.) 

On the basis of studies of growth patterns in 
bones, LeCHE participant Jean-Denis Vigne, 
an archaeozoologist at the French National 
Museum of Natural History in Paris, suggests 


DAIRY DIASPORA 


Dairying practices spread from the Middle East to 
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that dairying in the Middle East may go all the 
way back to when humans first started domes- 
ticating animals there, about 10,500 years ago®. 
That would place it just after the Middle East- 
ern Neolithic transition — when an economy 
based on hunter-gathering gave way to one 
devoted to agriculture. Dairying, says Roz 
Gillis, also an archaeozoologist at the Paris 
museum, “may have been one of the reasons 
why human populations began trapping and 
keeping ruminants such as cattle, sheep and 
goats” (See ‘Dairy diaspora’) 

Dairying then expanded in concert with 


Europe as part of the Neolithic transition from hunting 


and gathering to agriculture. 


6,500 YEARS AGO 
Well-developed dairy 
economy established in 
central Europe. 


LACTASE HOTSPOTS 


Piece of a roughly 
7,000-year-old 
sieve used to 
make cheese. 


7,500 YEARS AGO 
Lactase persistence, the ability to drink milk 
in adulthood, emerges in central Europe. 


8,000 YEARS AGO 


Neolithic reaches the Balkans. 


8,400 YEARS AGO 


Neolithic spreads to Greece. 
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11,000-10,000 YEARS AGO 

Neolithic culture develops in the Middle 
East. This is the start of agriculture and 
possibly the domestication of dairy animals. 


Only one-third of people produce the lactase enzyme 
during adulthood, which enables them to drink milk. 
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Percentage of adult population 
that can drink milk 
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the Neolithic transition, says Gillis, who has 
looked at bone growth at 150 sites in Europe 
and Anatolia (modern Turkey). As agriculture 
spread from Anatolia to northern Europe over 
roughly two millennia, dairying followed a 
similar pattern. 

On their own, the growth patterns do not 
say whether the Neolithic transition in Europe 
happened through evolution or replacement, 
but cattle bones offer important clues. In a pre- 
cursor study’, Burger and several other LECHE 
participants found that domesticated cattle at 
Neolithic sites in Europe were most closely 
related to cows from the Middle East, rather 
than indigenous wild aurochs. This is a strong 
indication that incoming herders brought their 
cattle with them, rather than domesticating 
locally, says Burger. 
A similar story is 
emerging from stud- 
ies of ancient human 
DNA recovered at a 
few sites in central 
Europe, which sug- 
gest that Neolithic 
farmers were not descended from the hunter- 
gatherers who lived there before’. 

Taken together, the data help to resolve the 
origins of the first European farmers. “For 
a long time, the mainstream of continental 
European archaeology said Mesolithic hunter- 
gatherers developed into Neolithic farmers,” 
says Burger. “We basically showed they were 
completely different.” 


MILK OR MEAT 

Given that dairying in the Middle East 
started thousands of years before the LP allele 
emerged in Europe, ancient herders must have 
found ways to reduce lactose concentrations in 
milk. It seems likely that they did so by making 
cheese or yogurt. (Fermented cheeses such as 
feta and cheddar have a small fraction of the 
lactose found in fresh milk; aged hard cheeses 
similar to Parmesan have hardly any.) 

To test that theory, LeCHE researchers ran 
chemical tests on ancient pottery. The coarse, 
porous clay contains enough residues for chem- 
ists to distinguish what type of fat was absorbed 
during the cooking process: whether it was 
from meat or milk, and from ruminants such 
as cows, sheep and goats or from other animals. 
“That gave us a way into saying what types of 
things were being cooked,” says Richard Ever- 
shed, a chemist at the University of Bristol. 

Evershed and his LeCHE collaborators 
found milk fat on pottery in the Middle 
Eastern Fertile Crescent going back at least 
8,500 years”, and Roffet-Salque’s work on the 
Polish pottery’ offers clear evidence that herd- 
ers in Europe were producing cheese to supple- 
ment their diets between 6,800 and 7,400 years 
ago. By then, dairy had become a component 
of the Neolithic diet, but it was not yet a domi- 
nant part of the economy. 

That next step happened slowly, and it seems 
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to have required the spread of lactase persis- 
tence. The LP allele did not become common 
in the population until some time after it first 
emerged: Burger has looked for the mutation 
in samples of ancient human DNA and has 
found it only as far back as 6,500 years ago in 
northern Germany. 

Models created by LeCHE participant 
Pascale Gerbault, a population geneticist at 
University College London, explain how the 
trait might have spread. As Middle Eastern 
Neolithic cultures moved into Europe, their 
farming and herding technologies helped them 
to out-compete the local hunter-gatherers. And 
as the southerners pushed north, says Gerbault, 
the LP allele ‘surfed’ the wave of migration. 

Lactase persistence had a harder time 


“Tt’s been an enduring question in archaeology — 
whether we’ re descended from Middle Eastern 
farmers or indigenous hunter-gatherers.” 


becoming established in parts of southern 
Europe, because Neolithic farmers had settled 
there before the mutation appeared. But as 
the agricultural society expanded northwards 
and westwards into new territory, the advan- 
tage provided by lactase persistence had a big 
impact. “As the population grows quickly at 
the edge of the wave, the allele can increase in 
frequency,’ says Gerbault. 

The remnants of that pattern are still visible 
today. In southern Europe, lactase persistence 
is relatively rare — less than 40% in Greece and 
Turkey. In Britain and Scandinavia, by con- 
trast, more than 90% of adults can digest milk. 


CATTLE CONQUEST 

By the late Neolithic and early Bronze Age, 
around 5,000 years ago, the LP allele was preva- 
lent across most of northern and central Europe, 
and cattle herding had become a dominant part 
of the culture. “They discover this way of life, 
and once they can really get the nutritional ben- 
efits they increase or intensify herding as well?” 
says Burger. Cattle bones represent more than 
two-thirds of the animal bones in many late 
Neolithic and early Bronze Age archaeological 
sites in central and northern Europe. 

The LeCHE researchers are still puzzling out 
exactly why the ability to consume milk offered 
such an advantage in these regions. Thomas 
suggests that, as people moved north, milk 
would have been a hedge against famine. Dairy 
products — which could be stored for longer in 
colder climes — provided rich sources of calo- 
ries that were independent of growing seasons 
or bad harvests. 

Others think that milk may have helped, 
particularly in the north, because of its rela- 
tively high concentration of vitamin D, a 
nutrient that can help to ward off diseases 
such as rickets. Humans synthesize vitamin D 
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naturally only when exposed to the sun, which 
makes it difficult for northerners to make 
enough during winter months. But lactase per- 
sistence also took root in sunny Spain, casting 
vitamin D’s role into doubt. 

The LeCHE project may offer a model for 
how archaeological questions can be answered 
using a variety of disciplines and tools. “They 
have got a lot of different tentacles — archae- 
ology, palaeoanthropology, ancient DNA and 
modern DNA, chemical analysis — all focused 
on one single question,” says Ian Barnes, a 
palaeogeneticist at Royal Holloway, University 
of London, who is not involved in the project. 
“There are lots of other dietary changes which 
could be studied in this way.” 

The approach could, for example, help to 
tease apart the ori- 
gins of amylase, an 
enzyme that helps to 
break down starch. 
Researchers have 
suggested that the 
development of the 
enzyme may have 
followed — or made possible — the increas- 
ing appetite for grain that accompanied the 
growth of agriculture. Scientists also want to 
trace the evolution of alcohol dehydrogenase, 
which is crucial to the breakdown of alcohol 
and could reveal the origins of humanity’s 
thirst for drink. 

Some of the LeCHE participants are now 
probing further back in time, as part of a pro- 
ject named BEAN (Bridging the European and 
Anatolian Neolithic), which is looking at how 
the first farmers and herders made their way 
into Europe. Burger, Thomas and their BEAN 
collaborators will be in Turkey this summer, 
tracing the origins of the Neolithic using com- 
puter models and ancient-DNA analysis in the 
hope of better understanding who the early 
farmers were, and when they arrived in Europe. 

Along the way, they will encounter beyaz 
peynir, a salty sheep’s-milk cheese eaten with 
nearly every Turkish breakfast. It is probably 
much like the cheese that Neolithic farmers in 
the region would have eaten some 8,000 years 
ago — long before the march of lactase per- 
sistence allowed people to drink fresh milk. = 


Andrew Curry is a freelance writer in Berlin. 
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Cell sex matters 


Male and female cells can behave differently — it is time that researchers, 
journals and funders took this seriously, says Elizabeth Pollitzer. 


r Yransplanting muscle-derived stem 
cells into diseased muscle regenerates 
it — a phenomenon that holds major 

potential for human therapies. But for years, 

researchers were puzzled by the unpredict- 
ability of these cells — sometimes they 
would promote fast regeneration, at other 
times none at all. Then, in 2007, a group led 
by Johnny Huard, a stem-cell researcher at the 
University of Pittsburgh in Pennsylvania, hit 


on the rather surprising explanation — sex’, 
Muscle stem cells taken from female mice 
regenerate new muscle much faster than 
those from male mice when transplanted into 
diseased muscle of mice of either sex. 
Researchers have also found that cells 
taken from male and female mice respond 
differently to stress’, and that human cells 
exhibit wildly different concentrations of 
many metabolites across the sexes’. Evidence 
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is mounting that cells differ according to sex, 
irrespective of their history of exposure to 
sex hormones. These differences could have 
major implications for the susceptibility to 
and course of many diseases, their diagnosis 
and treatment. However, most cell biologists 
do not note whether the cells they are using 
come from males or females’. 

Between 1997 and 2001, ten prescrip- 
tion drugs were withdrawn from the 
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> market by the US Food and Drug 
Administration (FDA), eight of which were 
more dangerous to women than to men (see 
go.nature.com/ksindo). The ingredients 
used in non-prescription drugs can also 
pose greater health risks to women. In 2000, 
for instance, the FDA took steps to remove 
phenylpropanolamine, a component of many 
over-the-counter medications, from all drug 
products because of a reported increased 
risk of bleeding into the brain or into tissue 
around the brain in women but not in men. 
Such drug therapies are developed through 
basic research — but what if sex-related 
differences in studied cells contribute in a 
significant way to the observed effects? 

This major potential problem has a rela- 
tively easy solution. Researchers, journals 
and peer reviewers must simply pay atten- 
tion to cell sex, and whether it could affect 
findings. 


YOU'VE GOT MALE 

As a computing expert, the sex of cells has 
never had a direct influence on my research. 
I learned about the problem as director of 
genSET, a project funded by the European 
Union that looked at how gender issues 
are, or should be, addressed in science (see 
www.genderinscience.org). The biggest 
concern turned out to be not inequities in 
hiring or funding, but the absence of gender 
considerations in the design, analysis and 
reporting of bench experiments. 

In 2010, genSET convened a panel of 14 
experts who reviewed more than 120 papers 
from a range of fields — not just cell biol- 
ogy — and produced a consensus report 
with 13 recommendations (see go.nature. 
com/kxnpfn). They call for more thorough 
and systematic inclusion of sex and gender 
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analysis in study design, in the research 
process and in the training of scientists and 
lab managers. 

The fact that sex and gender do matter 
has been recognized for years in clinical 
research, a field in which researchers have 
often disproportionately sampled men. For 
instance, a 2009 review’ of hundreds of clin- 
ical studies in cancer revealed that 75% of 
them under-represented women, suggesting 
that women have less access to experimental 
medicine than men, and might respond less 
effectively to approved drugs. 

The imbalances extend to basic research. 
When the journal Pain reviewed its publi- 
cation record in 2007, 


it found that atleast “If working 
79% of animal studies with only one 
published inthejour- sex or gender, 
naloverthe preceding then specify 
ten years used male that, and 


subjects only, with 
just 8% of studies on 
females only. Another 4% were designed 
explicitly to test for sex differences and the 
rest did not specify®. 

Hormones do not provide the entire expla- 
nation for these sex differences — embryonic 
neurons isolated from male and female rats 
before being exposed to sex hormones have 
different cellular responses to stress. Female 
cells are more sensitive to agents that induce 
programmed cell death (apoptosis), whereas 
male cells react more to oxygen deprivation, 
ischaemia and other forms of stress’. Female 
cells also show greater sensitivity to ethanol 
than do male cells — again, independent of 
hormonal effects’. 

These dimorphisms in cellular behaviour 
may be because of different metabolic path- 
ways. Sex-stratified analyses of genomic data 
that look at the location and expression of the 
genes controlling key enzymes (such as the 
CPS1 gene, involved in protein and nitrogen 
metabolism) have revealed significant differ- 
ences between females and males’. The 
products of metabolic reactions may 

be used as biomarkers for diseases 
such as diabetes and Alzheimer’s, 
but clinicians might not always 
realize that they should expect 
different metabolite levels in 
men and women. 


explain why.” 


WAY FORWARD 
My appreciation of the impor- 
tance of this issue is helped by 
the fact that in my first degree, 
I took courses on molecular and 
cell biology, biochemistry, genet- 
ics and physics. I was introduced 
to the topic of social responsibility 
in science by the molecular biolo- 
gist and physicist Maurice Wilkins, 
then head of the newly created biophysics 
department at King’s College London. The 
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topic of gender in science has since taken an 
exciting form, opening up new directions 
for research, while improving its quality 
and efficacy. 

In November, the third Gender Summit 
will take place in Washington DC under 
the leadership of the US National Science 
Foundation (www.gender-summit.com). 
The summit will feature discussions about 
understanding and addressing sex and 
gender issues in basic and applied science. 

The three-day programme includes a 
wide range of topics delivered by more than 
100 speakers. Attendees will be able to learn 
about the effects that sex differences have 
at the fundamental level of biochemical 
processes involved in cell regulation; about 
dimorphic effects in the regenerative proper- 
ties of stem cells; and about how insights into 
the differences between the metabolic profiles 
of women and men can transform diagno- 
sis, therapy and health economics. This is 
timely: when I attended the 38th Federation 
of European Biochemical Societies Congress 
in St Petersburg, Russia, a few weeks ago, I was 
surprised to see no mention of sex and gender 
differences in the titles of programmed talks. 

The most important thing that scientists 
can do is to acknowledge that there are 
differences between male and female cells, 
and design experiments with this in mind. 
Do not combine male and female cells — 
separate them out, ideally performing the 
same experiment in each group. If working 
with only one sex or gender, then specify 
that, and explain why. Animal and human 
studies should abide by the same rules. 

Peer reviewers, editors and funders can 
help to bring about this change by insisting 
that researchers determine whether and how 
sex and gender could influence their findings. 
After its 2007 analysis, Pain released guide- 
lines that state medication trials must include 
sufficient numbers of men and women. In 
March 2012, the American Physiological 
Society issued a policy requiring identifica- 
tion of sex or gender in papers on animal or 
human studies (see go.nature.com/bmvgiz). 
Every member of the scientific community 
has the power and responsibility to do the 
same for cells. m 


Elizabeth Pollitzer is director of Portia in 
London. Portia is a non-profit organization 
that addresses gender issues in science. 
e-mail: elizabeth@portiaweb.org.uk 
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A golden eagle perches on the glove of a Kazakh hunter in the Altai Mountains, western Mongolia. 


ORNITHOLOGY 


Y eee : tery See 
« + i ihe Sk v os 


BOOKS & ARTS | COMMENT | 


Feathered encounters 


Nicky Clayton enjoys a tribute to the enduring relationship between birds and humanity. 


irds and People is a delightful duet 
Bev nature writer Mark Cocker 

and wildlife photographer David 
Tipling. This monumental compendium is a 
reference book on the relationship between 
the class Aves and humanity — told in part 
through the personal stories of more than 650 
people from 81 countries. But it is also a cof- 
fee-table collection of gorgeous images detail- 
ing that relationship, captured in 39 countries 
across 7 continents. At its core is our fasci- 
nation with birds, and their enchanting and 
elaborate acoustic, visual and aerial displays. 

Cocker vividly depicts how birds have 
become embodied in our arts and science, 
entwined in our philosophy and politics, and 
embedded in our cultures, from folklore to 
costume and cuisine. The eggs of the com- 
mon ostrich (Struthio camelus), for example, 
have been used as water carriers, and birds 
are a central inspiration in Western art: 
think of the ballet Swan Lake or Constantin 
Brancusi’s 1920s sculptural masterpiece Bird 
in Space. 

Part natural history, part cultural anthol- 
ogy, the book describes how and why birds 
have colonized our imagination and become 
an integral part of the everyday, from back- 
garden birdwatching to feather duvets and 
boiled eggs. Images of penguins seem to be 
ubiquitous, appearing on everything from 


— publisher Allen Lane’s 
famous paperbacks 
to Brazil’s Antarctica 
beer. As a devotee 
of crows, ravens and 
other corvids — justly 
celebrated for their 
intelligence in the art 


of Pacific Northwest- 
Birds and People ern Native Americans 
MARK COCKER — Iwas pleased to see 
Jonathan Cape: 2013. behavioural ecologist 
Tim Birkhead punc- 


turing the persistent myth that magpies are 
responsible for the decline of some garden 
bird species. 

There are some 10,500 species of bird and 
they inhabit every continent and almost 
every environment: land, water and the air 
(swifts, for instance, land only to breed). The 
book is organized by avian family, follow- 
ing the taxonomic-order classification and 
nomenclature of ornithologists Frank Gill 
and Minturn Wright. The science on each is 
beautifully plaited into the stories on birds’ 
places in human cultures. 

These stories are staggering in their 
diversity and as gripping for neophytes as 
for ornithologists. Cocker tells us that the 
domestic chicken, derived from the red jun- 
glefowl (Gallus gallus), is the biggest single 
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source of protein we consume. I shed a tear 
over how the black robin (Petroica traversi) 
of the Chatham Islands off New Zealand was 
pulled back from extinction, thanks to the 
dedication of scientist Don Merton and Old 
Blue, once the last remaining fertile female, 
which Merton found. Human-bird coopera- 
tion is an ancient phenomenon, of course: in 
East Asia, some fishermen encourage free- 
flying cormorants to catch fish. The birds 
deposit these on their ‘owner’s’ boat, and are 
fed with some of the fish as a reward. 

Numerous books have been written on the 
interactions between animals and human- 
kind; few investigate cultural perspectives. 
Of those that do, the tendency is to focus 
on a species or family. Birds and People is 
refreshing in not being speciesist. Cocker 
and Tipling explore a relationship that lies at 
the heart of human experience and that has 
inspired great science and art, from Charles 
Darwin's groundbreaking findings on the 
Galapagos finches to Percy Bysshe Shelley’s 
soaring 1820 poem To a Skylark. = 


Nicky Clayton is the professor of comparative 
cognition in the Psychology Department 

at the University of Cambridge, UK, and 
scientist in residence at Rambert, the UK’s 
flagship contemporary dance company. 
e-mail: nsc22@cam.ac.uk 
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ENVIRONMENT 


Blanking out the mess 


Edward Humes explores a study of the psychological dissociation plaguing our 


relationship with nature. 


cc hy do people make choices 
harmful to the environment 
even when they truly care 


about it?” This is the deceptively simple 
premise of Invisible Nature, in which envi- 
ronmental researcher Kenneth Worthy 
offers psychological, cultural and philo- 
sophical explanations for our curious and 
contradictory relationship with nature. 

It is a thought-provoking, if frustrating, 
exploration. Like many of us who write on 
the environment, Worthy has an easier time 
diagnosing the ailment than prescribing 
a plausible cure. Still, his book offers read- 
ers concerned about humanity’s anaemic 
response to the crises of climate, waste and 
pollution a fresh starting point from which 
to consider another vital question: what next? 

Worthy points to ‘dissociation’ as the 
key malady, particularly in the West. Mod- 
ern economies, societies and lifestyles, he 
argues, put time, distance and disinformation 
between our everyday choices and their envi- 
ronmental consequences, such as acid rain 
caused by factory emissions. That consumer 
choices are destroying vital natural resources 


26 | NATURE | VOL 500 | 1 AUGUST 2013 


isa factor rarely experienced directly and vis- 
cerally. We roll our rubbish to the kerb and it 
‘magically’ disappears, never requiring us to 
smell a refuse dump or see the accumulation 
of plastic in oceans. Our experience of nature 
is limited to curated parks and street trees. 
Therein, Worthy writes, lies our downfall. 
He offers eye-opening illustrations of the 
harm enabled by dissociation. Many con- 
sumers know that the silicon ‘brains’ inside 
their beloved smartphones are produced in 
clean, airtight rooms by workers wearing 
immaculate protective suits. Few realize 
that those suits are designed to protect the 
chips, not the workers. Toxic chemicals soak 
through, causing myriad health problems. 
In the most fascinating part of Invis- 
ible Nature, Worthy takes his enquiry in a 
more original direction by examining the 
psychology of dissociation. He draws envi- 
ronmental lessons from the landmark 1960s 
experiments of social psychologist Stanley 
Milgram, in which students administered 
what they believed were powerful electric 
shocks to a person in the next room when 
urged to do so by an authority figure, despite 
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hearing the person’s screams and pleas. The 
students’ destructive obedience has been 
partly attributed to dissociation. The per- 
centage of students willing to administer the 
largest shocks plummeted when they were in 
the same room as their victims, and fell even 
lower when they had to force their victims 
hands against a shock plate. The tendency of 
people acting as agents ofa trusted author- 
ity to commit acts they 
might never perform 
alone is another sort of 
dissociation: of action 

from responsibility. 
Worthy contends 
that Milgram’s results 
— and those of the 
L/ 20 subsequent obe- 
: dience experiments 
modelled on his work 


Invisible Nature: 


Healing the _ id ho- 
: provide a psycho 
Destructive logical explanation 
Divide Between fae icles 
People and the Or Wily People OW= 


ingly make environ- 
mentally destructive 
choices. He also draws 
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KENNETH WORTHY 
Prometheus: 2013. 
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on the genocide studies of psychologist 
Ervin Staub, who suggests that psycho- 
logical barriers to admitting that past acts 
were wrong encourage further destructive 
choices. 

Worthy’s argument begs many questions. 
Which variables lead to fewer destructive 
choices? Is detailed knowledge of environ- 
mental destruction enough, or is direct expe- 
rience needed? Can trusted authority figures 
enforce ‘constructive obedience, prodding 
people toward environmentally benign poli- 
cies? This approach has worked in numerous 
communities worldwide, where consumers 
have abandoned disposable plastic shopping 
bags, urged on by civic leaders, elected offi- 
cials and celebrity spokespeople. Most of all, 
I wanted Worthy to relate the thoughts and 
rationale of real people who chose a hybrid 
car or a gas guzzler, a reusable bag or a plastic 
sack. Why did they make those choices, and 
what would it take to alter them? 

Instead, Invisible Nature switches to a 
lengthy, dry attack on Plato, Aristotle and 
Descartes as the philosophical fathers of 
our estrangement from nature. Oddly, 
these historical figures are the book’s only 
fully developed characters. The sole mod- 
ern human is Worthy’s fictional character, 
Joe the accountant, a “typical office worker” 
who “spends most of his time with symbols, 
concepts, and figures that are abstractions of 
material-world things” Joe pops up now and 
then when the author needs a character who 
drinks from plastic water bottles or who is 
overly proud of his smartphone. But, being a 
caricature, he can't explain his choices — and 
that’s too bad. 

The book concludes with a disappoint- 
ingly familiar list of ways to enhance our 
engagement with nature: cycling more; 
planting a vegetable garden; and embracing 
Eastern philosophies that depict nature and 
humanity in a mutually dependent context. 

Invisible Nature could have been a great 
book had Worthy focused more on the psy- 
chology of dissociation and less on long- 
dead philosophers. Modern thinking on 
marketing, globalism and consumerism 
is more relevant to current environmental 
problems than ruminations on mind-body 
dualism. After all, it was J. Gordon Lippin- 
cott, father of corporate branding, who in 
1947 set out the axiom of modern, environ- 
mentally destructive consumerism: “Our 
willingness to part with something before 
it is completely worn out is a phenomenon 
noticeable in no other society in history ... 
It must be further nurtured even though it 
runs contrary to one of the oldest inbred 
laws of humanity, the law of thrift” m 


Edward Humes is a Pulitzer prizewinning 
journalist and the author of 12 non-fiction 
books. His most recent is Garbology: Our 
Dirty Love Affair With Trash. 
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in brief 


Island Life: Or, the Phenomena and Causes of Insular Faunas 

and Floras, Including a Revision and Attempted Solution of the 
Problem of Geological Climates 

Alfred Russel Wallace, with Lawrence R. Heaney UNIVERSITY OF CHICAGO 
Press (2013) 

Co-discoverer of natural selection Alfred Russel Wallace pioneered 
island biogeography — the study of how organisms are distributed 
on isolated land masses. In fact, he was the field’s Kepler, Linnaeus 
and even Chuck Berry, avers David Quammen in his foreword to this 
facsimile of Wallace’s landmark 1880 study (see also Nature 496, 
165-166; 2013). Includes a commentary by Lawrence R. Heaney. 


The Mind Within the Brain: How We Make Decisions and How 
Those Decisions Go Wrong 

A. David Redish OXFORD UNIVERSITY PRESS (2013) 

From drinking that fifth Martini to choosing a career, decisions 
shape our lives. In his study of the science of decision-making, 
neuroscientist David Redish keeps a cool head in an overheated 
theoretical arena. Synthesizing findings in psychology, robotics and 
more, he takes us from brain physiology to aspects of mind. His own 
decision-making model involves multiple ‘action-selection’ systems, 
failures in which can lead to behaviours such as compulsive 
gambling. Well researched and delivered with a light touch. 


Reality Check: How Science Deniers Threaten Our Future 

Donald R. Prothero INDIANA UNIVERSITY PRESS (2013) 

The shabby tactics of science denialists sparked this astute 
exposition of what we lose when science is sidelined. Palaeontologist 
Donald Prothero lays out key cases, such as former US President 
Ronald Reagan’s misbegotten Strategic Defense Initiative and the 
demonization of biologist Rachel Carson for her work on pesticides. 
The usual suspects — from creationists to anti-vaccinationists — are 
examined too. Ultimately, Prothero argues, by claiming that clear 
findings on climate change, biodiversity loss and overpopulation are 
false, deniers inch us closer to catastrophic planetary tipping points. 


The Sports Gene: Inside the Science of Extraordinary Athletic 
Performance 

David Epstein CURRENT (2013) 

Sports-science writer David Epstein enters the nature-nurture 
debate on elite sporting ability. Drawing on interviews with sports 
scientists and leading athletes, Epstein looks at psychologist Anders 
Ericsson’s ‘10,000-hour’ study on how practice can masquerade 
as innate talent; gender testing and chromosomal anomalies; 

the heritability of height; and more. His conclusion that sporting 
excellence is down to the interplay of luck, intent and effort hardly 
surprises, but this is a rollicking ride through the latest science. 


The Making of the Mind: The Neuroscience of Human Nature 
Ronald T. Kellogg PROMETHEUS (2013) 

The genetic overlap between Homo sapiens and the chimpanzee 

is a biological conundrum. Some primatologists focus on what we 
share; cognitive psychologist Ronald Kellogg looks at what makes 
the human brain distinct. He argues that it is the interaction of 
cognitive elements such as our complex social intelligence and 
comprehension of others’ mental intentions; symbolic thought and 
language; and a sense of time. Kellogg ends by speculating about 
the shaping of a “post-literate, Internet mind”. Barbara Kiser 
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A mannequin covered in letters denoting DNA bases draws visitors to an interactive exhibit in Genome. 


The big blueprint 


Gene Russo surveys a high-tech exhibition celebrating 
the double helix and the human genome sequence. 


wo landmark dates in biology sparked 
| Genome: Unlocking Lifes Code — the 
completion of the human genome 
10 years ago, and the unveiling of the dou- 
ble helix by James Watson and Francis Crick 
60 years ago. The exhibition, currently at the 
Smithsonian National Museum of Natural 
History in Washington DC, hasa tough task: 
to render the abstractions and complexi- 
ties of genomics a feast for the senses. That 
microscopic, interiorized world presents a 
curatorial challenge light years away from 
displaying Tyrannosaurus rex skeletons. 
Yet Genome — a Smithsonian Institution 
collaboration with the US National Human 
Genome Research Institute in Bethesda, 
Maryland — does engage, even if the content 
is sometimes a bit dense. Videos, genomics 
trivia and interactive screens navigated by 
track pad help to explore everything from 
bioethical quandaries to genomic testing. 
One section marks the two big anniversa- 
ries with films featur- 
ing, among others, 
Watson recalling the 
1953 discovery and 
National Institutes 


> NATURE.COM 
For a PastCast on the 
‘other’ DNA papers: 
go.nature.com/lizfik 
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Genome: of Health director 
Unlocking Life’s Francis Collins tout- 
Code 


ing the importance 
of making genomic 
data freely available. 
As lead curator Meg 
Rivers puts it, the idea 
is to reveal to visitors 
that genomics is ‘all about them: “The big- 
gest challenge is saying what a genome is, and 
making it meaningful,” she says. 

There are three thematic areas, covering 
genomics’ influence on health and medicine, 
genes and ancestry, and genomic connec- 
tions among living things. Any one of these 
could make an entire exhibition on its own. 

An animation on a large LCD screen 
nicely explains the inner workings of DNA, 
genes and proteins. Factoids flash on-screen. 
“Unfolded, DNA is six feet long,” we're told. 
“Yet it’s 30,000 times thinner than a strand of 
human hair” A section on next-generation 
genomic medicine focuses on combating 
disease, personalized drug treatments and 
DNA sequencing’s role in halting the spread 
of deadly infections. “How much do you 
want to know about your family’s genetic 


Smithsonian National 
Museum of Natural 
History, Washington DC. 
Until 1 September 
2014, then on tour. 
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make up?” asks one interactive panel. 

To learn about ancestry, you can drag an 
arrow across a timeline on a touch-screen 
world map to see how humankind migrated 
and how genetic diversity has changed over 
tens of thousands of years. Humans started 
out in Africa, but migrants took along only 
a portion of the genetic diversity — today, 
Africans harbour a more diverse gene pool 
than other populations. 

Another section asks what we can glean 
from the genomes of other life forms. Select- 
ing the horseshoe crab ona screen reveals that 
the species has not changed for 200 million 
years. ‘Is it a living fossil?’ it asks. No, we dis- 
cover: only a small part of the genome con- 
trols body shape; the rest of the genome has 
continued to evolve. 

Questions posted on the wall at the 
entrance ask visitors to describe their physi- 
cal features and then reveal what proportion 
of people share their particular combination. 
‘Do you have dimples?’ No. ‘Are the lobes of 
your ears attached?’ Yes. ‘Can you curl your 
tongue?’ No. ‘Is your hairline pointed in the 
center?’ No. Conclusion: “4% of 292 people 
polled share these four physical traits with 
you? Another part of the exhibit probes 
thorny ethical dilemmas. For example, 
should we place limits on genomic research? 
A touch-screen prompt then asks the user 
to consider factors such as facilitating faster 
cures and getting permission from subjects. 

To keep the show feeling fresh, a news 
ticker scrolls the most recent genetics- 
related stories onto a wall. On the day of 
the press preview, for example, the Supreme 
Court had just come down with its ruling 
against Myriad Genetics on patenting genes. 

My main quibble is that Genome offers 
little to younger children, in a museum 
receiving 7 million visitors annually. The 
omission is even odder given that another 
exhibition has already shown the way. In 
2001, the American Museum of Natural 
History in New York staged The Genomic 
Revolution. It featured a stack of 180 Man- 
hattan telephone books, 1,646 pages each, in 
the shape of a double helix; the number of 
characters in them roughly equals the num- 
ber of base pairs in DNA. Smaller children 
also loved the ‘mutation station, where you 
could ‘change a gene’ by turning arung ona 
giant plastic double helix. A television screen 
showed the physical effects in a fruit fly. 

In this age of iPads and touch-screen 
technology, one might have expected a 
more creative approach than Genome’s final 
draft — perhaps a big touch screen allowing 
users to drag and match ‘As’ with “Ts’ and ‘Cs’ 
with ‘Gs’ on an animated double helix. Such 
a presentation might have prompted young 
and old alike to think about this thing in all 
of us called a ‘genome’ m 


Gene Russo is editor of Nature Careers. 
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Reward research 
outreach in Japan 


The Japan Science and 
Technology Agency last month 
reported that only 64% of 
almost 9,000 surveyed Japanese 
scientists communicate results 
outside the scientific community. 
This is surprising: since 2010, 
the government has urged the 
researchers it funds to improve 
communication with the 
tax-paying public — particularly 
after the March 2011 tsunami and 
nuclear accident at Fukushima. 

Of the 36% of researchers who 
did not participate in outreach, 
many cited administrative 
overload and a lack of resources 
and time. The biggest issue 
among these scientists was that 
they felt there was no personal 
advantage in communicating 
their results to the public. This is 
because researcher performance 
in Japan is evaluated solely on the 
number of journal publications, 
so time and effort spent on 
science communication will not 
help scientists to secure funding, 
promotion or employment. 

The government needs 
to recognize that scientific 
outreach activities are valid 
experiences for funding 
and job applications. Japan's 
researchers, in turn, should 
recognize that communicating 
with the public provides its own 
reward in the form of society's 
acknowledgement of their work. 
Amane Koizumi National 
Institute for Physiological 
Sciences, Aichi, Japan. 
amane@nips.ac.jp 
Yuko Morita National Museum of 
Emerging Science and Innovation 
(Miraikan), Tokyo, Japan. 
Shishin Kawamoto Hokkaido 
University, Hokkaido, Japan. 


Flanders overrates 
impact factors 


A renewed decree on research 
funding last year by the 
government of Flanders in 
northern Belgium advised that 
40% of research evaluation 


should be based on bibliometric 
data. This involves a complex 
calculation that includes the 
number of publications and 
citations, and the impact factors 
for the journals of publication 
(see go.nature.com/mt9srg; in 
Dutch). We question the merits of 
this strategy, given the debatable 
value of impact factors in gauging 
research quality (see, for example, 
B. Alberts Science 340, 787; 2013). 

The Flemish Interuniversity 
Council in fact advised Flemish 
universities in 2010 to reduce the 
importance of bibliometric data 
when weighing up researchers for 
appointment or promotion. In 
practice, however, such data have 
become even more important. 

It is time to heed the widely 
recognized risk of overrating the 
quantity of scientific publications, 
which can compromise research 
quality and integrity. 

Karen Stroobants, Simon 
Godecharle Research 
Foundation Flanders (FWO) and 
University of Leuven, Belgium. 
karen.stroobants@chem.kuleuven.be 
Sofie Brouwers FWO and Vrije 
Universiteit Brussel, Belgium. 


Time-lapsed awards 
for excellence 


The number of scientific 
prizes has proliferated in the 
past 20 years (see Nature 498, 
152-154; 2013). But oncea 
scientist has published a seminal 
contribution, how long is it 
before these glittering symbols of 
recognition come through? 
Occasionally, recognition 
comes early ina scientific career. 
For example, the biologist 
Thomas Henry Huxley was 
elected as a fellow of the Royal 
Society in London in 1851 at 
age 26, and William Lawrence 
Bragg won the Nobel Prize in 
Physics in 1915 when he was just 
25. But these may be exceptions. 
We looked at 466 recipients 
of Nobel prizes in physics, 
chemistry, and physiology or 
medicine, awarded from 1901 
to 2000, using a biographical 
encyclopedia that also lists 


all other major institutional 
awards won by an individual 

(G. T. Kurian The Nobel 
Scientists, Prometheus; 2002). 
These include prizes such as the 
Davy Medal, the Max Planck 
Medal and the Canada Gairdner 
International Award. 

Using the year in which 
laureates produced their 
Nobel-prizewinning work as an 
indicator for the timing of their 
peak scientific achievement, we 
calculated the average lag time in 
each field between this and the 
timing of the Nobel prize and 
other major awards. 

Our investigations indicate 
that recognition is conferred 
relatively rapidly. In physics, the 
lag was just 5 years; in chemistry, 
9 years; and in physiology or 
medicine, it was 11 years. It 
seems that important discoveries 
in physics, and perhaps in 
chemistry, are more easily 
defined than in physiology or 
medicine, so their merits are 
more swiftly recognized. 

Ho Fai Chan, Benno Torgler 
Queensland University of 
Technology, Brisbane, Australia. 
benno.torgler@qut.edu.au 


Joint Indian initiative 
creates tiger corridor 


India harbours 60% of the world’s 
wild tigers but, as elsewhere, 
populations are declining 
because of habitat destruction. 
An initiative between an Indian 
state government and civil 
society is set to counteract this 
trend, and offers an instructive 
conservation model. 

Karnataka state in southern 
India has implemented several 
effective tiger-conservation 
strategies. However, the protected 
areas that shelter tigers are 
disjointed and isolated, and 
tiger populations are becoming 
increasingly fragmented. 

Habitat connectivity between 
populations is essential for the 
tiger’s future survival. It aids 
dispersal of tigers from source 
sites, enables genetic exchange 
between populations and helps to 


maintain meta-populations. 

In response, the state 
government is actively linking 
tiger populations by identifying 
ecologically important habitats 
and designating them as 
protected areas to ensure long- 
term habitat protection. 

For example, in May 2013 
it established the 906-square- 
kilometre Malai Mahadeshwara 
Wildlife Sanctuary, creating a 
6,500-km? contiguous network 
of protected areas — the largest 
in the country. Over the past 
19 months, the state government 
has added nearly 3,000 km’ of 
tiger habitat to the network. 

This extended landscape 
should be able to host a connected 
population of some 500 tigers. 
Sanjay Gubbi Panthera, 

New York, USA; and Nature 
Conservation Foundation, 
Mysore, India. 
sanjaygubbi@gmail.com 

H. C. Poornesha Nature 
Conservation Foundation, 
Mysore, India. 


Comparing science 
and music is unsound 


Stephane Detournay suggests 
that both scientific research and 
music composition undergo 
phases of “onset, development, 
refinement and exposition” 
(Nature 499, 245; 2013). But so 
do hedge-fund management and 
tomato farming. 

One thing that binds music 
and science is the idea that 
sincere, personal investment 
in one’s work is good. Yet 
music is mainly about evoking, 
interpreting and savouring 
emotions, whereas science is 
essentially the pursuit of truth. 
Justin Jee New York University, 
New York, USA. 
justin.jee@med.nyu.edu 
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OBITUARY 


Kenneth Geddes Wilson 


(1936-2013) 


Nobel-prizewinning physicist who revolutionized theoretical science. 


efore Kenneth Wilson’s work, 
B calculations in particle physics were 

plagued by infinities. Results came 
from a workaround called renormalization, 
dubbed “ugly” even by one of its inven- 
tors, theoretical physicist Paul Dirac. In the 
1970s, Wilson reformulated this method. 
Almost immediately, renormalization 
became a respectable and widely used tool, 
forming the basis of thousands of papers in 
condensed-matter and particle physics. It 
is now our primary method for seeing the 
connections among different theories in the 
physical sciences. 

Wilson died on 15 June 2013. Born in 
1936 in Waltham, Massachusetts, he started 
life amid theoretical science. His grand- 
father taught engineering at the Massachu- 
setts Institute of Technology in Cambridge; 
his father was a theoretical chemist at 
Harvard University, also in Cambridge. Like 
his father, Kenneth Wilson was appointed to 
the Society of Fellows at Harvard, a group of 
young scholars picked for their exceptional 
promise and then given no responsibilities 
whatsoever. 

Ken got his PhD in particle physics under 
Murray Gell-Mann at the California Insti- 
tute of Technology in Pasadena. Appointed 
to the junior faculty at Cornell University 
in Ithaca, New York, in 1963, Wilson ini- 
tially proved a disappointment, at least to 
those who vet promotions by bean count- 
ing. Despite having no journal publications 
before 1969, Wilson's exceptional prom- 
ise brought him promotion to tenure at 
Cornell after just two years. Then, in 1971, 
Wilson revolutionized the mathematical 
sciences. 

Both particle and condensed-matter phys- 
ics describe observations at length scales set 
by the experimental apparatus, but depend 
on processes occurring at much shorter 
distances. So observation-based theories, 
whether they are about the elasticity of 
materials or the collisions of observable 
‘elementary’ particles, cannot accurately 
describe natural forces acting at small scales. 

This weakness showed up in mid-twen- 
tieth-century particle physics through the 
generation of infinities in most calculations. 
Before Wilson’s work, the renormalization 
method achieved finite results by replacing 
the infinite quantities in the theory with 
empirically derived, finite quantities. 

During the 1960s, Wilson noted the close 
analogy between particle physics and phase 
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transitions. (A phase transition is the change 
of matter from one form to another, such 
as the boiling of liquid water to produce 
steam.) Phase transitions had a rich experi- 
mental, analytical and numerical tradition, 
based on well-known science at the atomic 


or molecular level. As a result, a theory in 
this area could be verified or disproved by 
comparison with known results. But a theory 
was needed to bridge the gap between basic 
forces at one length scale and observations 
at avery different scale. 

Ken built a new form of analysis for phase 
transitions. He focused on theories defined 
at different length scales, broadening the 
renormalization calculation of the connec- 
tion between the scales to include all physical 
processes, not just the few that might show 
infinities. He added an analysis in which the 
eventual result of many changes in length 
would be a ‘fixed point. This result was easy 
to analyse because of its scale-independent 
behaviour. 

Soon after Wilson first described his 
ideas, he worked with Michael Fisher, also at 
Cornell, to calculate the mathematical form 
of the interrelation among density, pressure 
and temperature for a fluid hovering close 
to its transition from liquid to vapour. The 
amazingly close agreement of their results 
with experimental findings helped to 
accelerate the acceptance of the new renor- 
malization theory. 

Wilson also applied his ideas to the 


© 2013 Macmillan Publishers Limited. All rights reserved 


quarks that underlie a substantial portion of 
high-energy theory. He pointed out that 
these quarks can fall into several thermo- 
dynamic phases, differing in behaviour. In 
the phase that is found all around us, quarks 
become hard to observe because they are 
closely bound to one another. This work 
was important for understanding the basic 
theory of quantum chromodynamics and 
its realization in numerical computations, 
described as lattice gauge theory. This work 
founded a subfield of science, now practised 
at many centres. 

In 1982, Wilson won a Nobel prize for 
his work on phase transitions. However, 
his renormalization has had an impact that 
is substantially beyond any single area of 
science. One might argue, as I do, that the 
connections among ‘laws of nature’ at differ- 
ent scales of energy, length or aggregation is 
the root subject of physics. It could then be 
said that Wilson has provided scientists with 
the single most relevant tool for understand- 
ing the basis of physics. 

That was Wilson’s’s main accomplish- 
ment. But it was not all. His work drove us 
to develop different work styles and new 
scientific areas. He implicitly encouraged 
us to emulate computers and computer 
programs in order to track all the different 
processes that arise in changing length scales. 
Wilson was influential in his championing 
of supercomputers and the National Science 
Foundation’s national scientific supercom- 
puting centres, one of which was established 
at Cornell in 1985. He constructed computer 
programs to enable the flexible use of very 
large computers for research. This work was, 
in part, carried out with his wife, computer 
scientist Alison Brown. 

Ken was unfailingly generous to those 
working in his area. For example, he gave 
very careful credit in his papers to previ- 
ous workers. I note Ken’s kind (and almost 
unprecedented) inclusion of my name in the 
title of his first, great, renormalization paper. 

The brilliance of Kenneth G. Wilson was 
dazzling, but he never tried to outshine those 
about him. He was all quiet competence and 
deep accomplishments. m 


Leo P. Kadanoff is a theoretical physicist 
and applied mathematician at the Perimeter 
Institute, Waterloo, Ontario, Canada; and 
at the James Franck Institute, University of 
Chicago, Illinois, USA. 
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REGENERATIVE BIOLOGY 


On with their heads 


Data from three teams show that alteration ofa single cell-signalling mechanism can unlock the latent head-regeneration 
potential in normally regeneration- deficient species of flatworm. SEE LETTERS P.73, P.77 & p.81 


ANDRAS SIMON 


oltaire was not only a leading philoso- 
\ / pher of his time but, inspired by natural 
scientists, he also carried out intrigu- 
ing experiments that showed that decapitated 
snails could regenerate their heads. On the 
basis of these observations, Voltaire was said 
to be optimistic about humans being able to do 
the same one day’. Although the arrival of that 
day is apparently not imminent, clues to how 
to unlock a latent potential for head regrowth 
are presented in three papers* in this issue” *, 

Some animals regenerate organs and even 
body parts following major injuries. The 
distribution of species with these abilities is 
seemingly random: representatives range from 
relatively simple organisms, such as hydra, to 
the more complex flatworms, and on to verte- 
brates such as salamanders”. The regenerative 
capacity of these organisms is quite remark- 
able. For example, some flatworms can be 
sectioned into small pieces, each of which 
will give rise to a new, complete worm’. And 
salamanders, after amputation, regrow entire 
limbs that are fully functional, perfect copies 
of the original extremity’. 

Why some animals but not others respond 
to injury with such extensive regeneration is 
an outstanding question both from an evo- 
lutionary perspective and when considering 
possible interventions to enhance regenerative 
capacity. One way to address this question is to 
compare closely related species with differing 
injury responses. The three new papers all take 
this approach, by studying highly regenerative 
and regeneration-deficient flatworms. Two flat- 
worm species, Dugesia japonica and Schmidtea 
mediterranea, can regenerate all missing body 
parts after transection, irrespective of how many 
times or where they are cut along the head-to- 
tail (anterior-to-posterior) axis. By contrast, the 
three non-regenerative species investigated in 
the studies reported here — Phagocata kawakat- 
sui by Umesono et al.’ (page 73), Procotyla flu- 
viatilis by Sikes and Newmark’ (page 77) and 
Dendrocoelum lacteum by Liu et al.’ (page 81) 
— all fail to regenerate heads from the tail frag- 
ment left after transection through the most- 
posterior third of their bodies (Fig. 1). 


*This article and the papers under discussion?“ were 
published online on 24 July 2013. 
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Figure 1 | Dendrocoelum lacteum. These flatworms, the subjects of Liu and colleagues’ study’, are 
unable to regenerate a head from a tail fragment. 


What mechanisms underlie these differ- 
ences? Regeneration normally starts by wound 
healing and involves the proliferation of stem 
cells called neoblasts. These cells give rise to a 
cell mass called a blastema, which subsequently 
grows and differentiates into a fully patterned 
new body part. It seems that regeneration- 
deficient species respond to an injury with 
normal wound healing and cell-proliferation 
dynamics. This extended to blastema formation 
in the tail fragments of D. lacteum’, but this was 
not observed in P. fluviatilis’. Thus, the regen- 
erative process seems to stop at or just before 
blastema formation. However, given the paucity 
of biological markers for blastemas, some cau- 
tion about these conclusions may be warranted. 

Importantly, the papers converge on the 
identity of a molecular signalling pathway that 
should be targeted to evoke head regeneration 
from tail fragments in all three regeneration- 
deficient species — the Wnt/B-catenin pathway, 
which plays a key part in several processes dur- 
ing embryonic development and adult homeo- 
stasis in multicellular organisms. Wnt proteins 
cause accumulation of §-catenin, which in turn 
regulates gene expression’, and perturbation 
of Wnt/B-catenin signalling is associated with 
developmental malformations, inappropriate 
cell turnover during adult homeostasis and 
tumour formation’. The common conclusion 
of the present studies is that downregulating the 
activity of this pathway by reducing b-catenin 
levels leads to head regeneration from other- 
wise regeneration-deficient tail fragments. 
Heads regenerated through this approach had 
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morphologically normal brains and contained 
photoreceptors — the ‘simple eyes’ of these 
animals. Furthermore, regenerated animals 
displayed normal behavioural responses to light 
and reacquired feeding ability. 

The significance of Wnt/B-catenin signalling 
in regeneration has previously been highlighted 
in regeneration-competent flatworms. In these 
animals, reduced activity of this pathway leads 
to outgrowth ofa head instead ofa tail from pos- 
terior-facing wounds (resulting in a two-headed 
animal) and, conversely, enhanced activation of 
the pathway by stabilization of B-catenin leads 
to tail instead of head outgrowth from anterior- 
facing wounds (giving a two-tailed animal)”°. 
In agreement with these observations, two of 
the present studies** found that stabilization 
of B-catenin also results in tail outgrowth from 
tail fragments in the normally non-regenerative 
P fluviatilis and D. lacteum. 

A possible interpretation of these combined 
findings is that achieving polarity along the 
head-to-tail axis is required for injury responses 
to proceed to the creation of a new body part, 
and that the inability to progress past blastema 
formation to regenerate a head is the result of 
insufficient polarity in Wnt/B-catenin signal- 
ling (Table 1). The next question is how Wnt/B- 
catenin signalling creates axial polarity. Data 
from Umesono and colleagues’ study” indicate 
that another pathway, ERK signalling, normally 
forms an activity gradient that decreases from 
head to tail, and the authors suggest that the 
Wnt/B-catenin pathway exerts its effects by 
suppressing ERK signalling. According to their 
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TABLE 1 | SIGNALLING AND REGENERATION 


Effects of altered Wnt/B-catenin signalling in regeneration-deficient flatworms 


Wnt/B-catenin signalling Axial polarity Regeneration from tail fragment 
Natural (deficient) regulation Insufficient None 

Downregulated signalling Correct Head regeneration 

Upregulated signalling Reversed Tail regeneration 


model, P. kawakatsui fails to regenerate a head 
from tail fragments because head regeneration 
requires high levels of ERK signalling and this 
is prevented by Wnt/B-catenin activity. It will 
be interesting to further dissect these putative 
molecular interactions and determine whether 
the model also holds true for other regenera- 
tion-deficient worms. 

How should we consider the evolutionar- 
ily uneven distribution of regenerative ability, 
and what are the implications of the current 
findings for regeneration in mammals? Is 
it possible, as these studies might indicate, 
that regeneration is an inherent trait that is 
manifest in some organisms but silent in oth- 
ers, such that reshuffling the interactions of 


CLIMATE SCIENCE 


existing molecular components is sufficient for 
its awakening? Alternatively, is regeneration a 
capability that has been acquired by some spe- 
cies, possibly in a process that requires species- 
specific molecular components’? 

Inducing regeneration of a body partin a spe- 
cies that cannot undergo full regeneration but 
is in many aspects regeneration-competent is 
quite different from achieving something com- 
parable in largely non-regenerative organisms, 
which include most mammals. Nevertheless, 
the new studies highlight the importance of 
cross-species comparisons for understanding 
the molecular mechanisms of regeneration and 
identifying processes that prevent it from occur- 
ring. Moreover, the finding that manipulation 


Unequal equinoxes 


Innovative measurements of ocean turbulence show that mixing of cold water 
from below makes the surface of the equatorial Pacific much colder in September 


than in March. SEE LETTER P.64 


SHANG-PING XIE 


quatorial countries such as Indonesia 
EE Ecuador are associated with thun- 

der showers and lush rainforests. In 
September 1835, when Charles Darwin set 
foot on the Galapagos Islands, an archipelago 
on the Equator some 1,000 kilometres west of 
the Ecuadorean coast, he found none of these. 
Instead, he saw very dry land and leafless trees 
“in an intertropical region, but without the 
beauty which generally accompanies such 
a position”'. The cold surface sea water that 
keeps the Galapagos dry is part of a basin-scale 
structure called the equatorial cold tongue, and 
is known to originate from beneath the surface. 
But how the cold deep water reaches the surface 
has never been satisfactorily measured through 
a seasonal cycle, because of the prohibitive costs 
of operating ships in this remote ocean. 

On page 64 of this issue, Moum et al.’ report 
multi-year measurements of ocean turbulence 
using a revolutionary instrument called an 
x-pod, which is moored in the ocean (Fig. 1)*. 
They show that small ocean eddies of diam- 
eter less than 1 metre bring cold water from the 


*This article and the paper under discussion’ were 
published online on 24 July 2013. 


thermocline layer (the ocean layer that sepa- 
rates the warm upper part of the ocean from 
the cold deeper part) to the sea surface. The 
new instrument has finally clinched the con- 
nection between turbulence and climate, and 
holds the promise of improving global climate 
models. 

In explaining the absence of coral reefs 
around the Galapagos Islands, Darwin cal- 
culated that sea surface temperature (SST) 
was 20°C during his 35-day stay’, close to the 
modern September climatology (21° C) for 
this region, but much lower than in the west- 
ern Pacific (29°C) at this time of the year. Had 
Darwin come to the islands in March, he would 
have been welcomed by trees covered with 
green leaves. SST in March reaches nearly 27°C, 
warm enough to permit rain showers. The pro- 
nounced annual cycle in SST is a mystery given 
our experience that the seasonal change in tem- 
perature is caused by the local change in solar 
radiation. The Sun marches across the Equator 
twice a year, at the March and September equi- 
noxes, yet the SSTs between the two months 
differ by a whopping 6°C. What makes Septem- 
ber cooler than March in the Galapagos? 

Because of Earth’s rotation, easterly winds 
force upward motion in the Pacific Ocean on 
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ofa single major signalling pathway is sufficient 
to induce regeneration of a body part gives 
hope for the future that we will be able at least 
to improve regenerative responses in humans. = 


Andras Simon is in the Department of Cell 
and Molecular Biology, Karolinska Institute, 
17177 Stockholm, Sweden. 

e-mail: andras.simon@ki.se 
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the Equator that brings the cold water from 
the thermocline close to the surface, a process 
called upwelling. Likewise, southerly winds 
cause upwelling slightly (100-200 km) south 
of the Equator’. The climatic axis of symmetry 
turns out not to be on the geographic Equator, 
but 1,000 km to the north in the intertropical 
convergence zone where the southern and 
northern trade-wind systems converge to pro- 
duce frequent thunder showers. Because the 
intertropical convergence zone is displaced 
north of the Equator, the Equator belongs to 
the Southern Hemisphere from a climatic per- 
spective, being warm in March and cold in Sep- 
tember’. In September, the seasonal warming 
of the Northern Hemisphere and cooling of the 
Southern Hemisphere accelerate the southeast 
trade winds, intensifying ocean upwelling. In 
March, the seasonal warming of the Southern 
Hemisphere weakens, but does not reverse the 
direction of the southeasterly cross-equatorial 
winds, suppressing ocean upwelling. Thus, the 
southeast trade winds attain an annual cycle in 
speed that drives annual variations in ocean 
upwelling and SST near the Equator’. 
Upwelling pushes the thermocline close to 
the surface but it takes turbulence at sub-metre 
scales to mix the cold thermocline water into 
the surface layer. Conventionally, measure- 
ments of such ocean mixing are made by drop- 
ping microstructure profilers from ships, but 
nobody can afford to station a research vessel 
on the Equator for a year — the minimum 
length of time needed to unlock the mystery 
of the equatorial annual cycle. Moum and col- 
leagues invented y-pods for deployment on 
moored buoys’ to measure the dissipation rate 
of turbulent temperature variance (x) for up 
to a year at a time. x; is a measure of turbulent 
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Figure 1 | Tropical Pacific climate. The image shows the September sea- 
surface temperature of the tropical Pacific, along with wind velocity (white 
arrows) on the equator and the location of atmospheric convection (cloud 


mixing intensity, from which one can calculate 
turbulent heat flux in the vertical direction. 
Their data, which span 6 years on the Equa- 
tor at 140° W, show that the turbulent heat flux 
from below peaks in August and is the domi- 
nant mechanism for the rapid surface cooling, 
finally confirming a long-held hypothesis that 
ocean mixing drives the equatorial annual cycle. 

Thus, the equatorial cold tongue (Fig. 1) is 
a tale of remarkable interactions across eight 
orders of magnitude, from turbulence (10°! m) 
to the size of the vast Pacific Ocean (10’ m). 
Local and basin-scale winds set up the tem- 
perature and current structures that create 
sub-metre turbulence, and the turbulent mix- 
ing enables subsurface changes to affect SST 
and the atmosphere. Climate models do not 
do very well in the tropical Pacific, producing a 
cold tongue that is too cold and extends too far 
westwards*. Ocean mixing is considered one of 
the culprits for this. Over time, moored y-pods 
will sample the oscillations between the El Nifio 
and La Nifia phases of ocean state, and reveal 
how large-scale ocean variations modulate tur- 
bulent mixing. Such observations are invaluable 
for validating and improving models. 

Deploying y-pods at other mooring sites 
across the equatorial Pacific is a natural next 
step. At the dateline, which marks the western 
terminus of the equatorial cold tongue, ocean- 
mixing observations can shed light on why the 
cold tongue in models extends too far west- 
wards. Moum and colleagues’ y-pod deploy- 
ment at 140° W is in an easterly wind regime 
with the upwelling centred on the Equator. By 
contrast, the eastern Pacific at 95° W is ina 
southerly wind regime, where the upwelling 
occurs south of the Equator (Fig. 1). A cross- 
equatorial array will reveal how ocean mixing 
responds to the southerly winds and displaces 
the cold tongue south of the Equator. There- 
fore, x-pods fill a crucial gap by enabling a 
close look at how ocean mixing and climate 
co-vary and interact. m 


Shang-Ping Xie is at the Scripps Institution of 
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From big data to 


mechanism 


The gene variant APOE4 is the most common genetic risk factor associated with 
late-onset Alzheimer’s disease. A comprehensive, multilayer study reveals the 
molecular and cellular signatures of APOE4 in humans. SEE ARTICLE P.45 


VIVEK SWARUP & DANIEL H. GESCHWIND 


Izheimer’s disease is a genetically com- 
A neurodegenerative condition typi- 
cally associated with ageing. Reporting 
in this issue, Rhinn et al.’ (page 45) use an inte- 
grative genomic approach based on analysis 
of transcriptional networks in human brain to 
uncover new relationships between this disor- 
der, ageing, and the gene apolipoprotein E, or 
APOE, shedding light on molecular mecha- 
nisms that connect this gene to the processing 
and transport of amyloid precursor protein — 
considered to bea primary effector of the dis- 
ease. To validate their hypotheses, the authors 
follow bioinformatic analyses with multiple 
levels of experimentation, impressively dem- 
onstrating the power of network biology*. 
Recent genome-wide association stud- 
ies (GWAS) and transcriptomic studies have 
implicated** a wide set of pathophysiological 
mechanisms in Alzheimer’s disease, includ- 
ing innate immunity, cellular uptake of mol- 
ecules by the process of endocytosis and their 


*This article and the paper under discussion’ were 
published online on 24 July 2013. 
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intracellular trafficking. However, a variant of 
APOE called APOF4 remains the most com- 
mon genetic risk factor, with the largest effect 
in late-onset Alzheimer’s disease (LOAD)*. 
Therefore, understanding how APOE4 con- 
veys disease risk and its relationship with 
ageing remains a research imperative. 

Rhinn et al. start with a simple, but previ- 
ously unexplored, set of comparisons using 
published data on gene expression in the 
cerebral cortex region of the human brain to 
address a question widely debated in the field: 
do the molecular mechanisms underlying 
LOAD differ between those who carry APOE4 
and those who do not? 

To discern the independent effects of disease 
and APOE status, the authors compare APOE4- 
negative healthy individuals with both APOE4- 
negative LOAD patients and APOE4-positive 
healthy individuals. Remarkably, they find that 
the total cellular transcript (transcriptome) pro- 
file of healthy APOE4 carriers overlaps to a large 
extent with that of LOAD patients, and that the 
overlapping set of transcripts (designated A/E) 
most closely resembles the changes in transcript 
profile observed during the transition from 


normal to incipient LOAD — an early phase 
of Alzheimer’s disease. These data indicate that 
the APOF4 status of an individual reflects a pro- 
dromal, or ‘primed; state for developing LOAD. 

Notably, transcriptional changes observed 
during normal ageing (through comparison of 
those under 75 years of age with those over 85) 
overlapped significantly with LOAD, but not 
with changes specific to APOE4 carriers. That 
certain transcriptional alterations in LOAD are 
seen either with ageing or with APOF4 status 
suggests that the contributions of APOE4 to 
LOAD are mechanistically distinct from those 
of ageing. A potential caveat of this analysis, 
however, is the use of a hard categorical cut- 
off of age 75, rather than, for example, 65 — 
the age below which Alzheimer’s disease 
is considered early onset. In this regard, a 
non-categorical quantitative analysis of tran- 
scripts associated with ageing should provide 
maximum analytical power. 

Network-level co-expression analyses have 
been used successfully before to investigate 
disease-related molecular pathways in vari- 
ous neuropsychiatric”® and neurodegenera- 
tive disorders, including Alzheimer’s disease*”’; 
such analyses permit an unbiased, systems- 
level understanding of disease mechanism. 
Rhinn and colleagues’ work differs from other 
network-based co-expression approaches 
in focusing on differential co-expression — 
that is, on changes in co-expression patterns 
observed between control and disease samples. 
The authors, who used a transcriptome-wide 
network-analysis tool called differential co- 
expression correlation analysis (DCA), provide 
extensive experimental support for the targets 
identified, which adds an unprecedented level 
of validation. 

Of the 215 differentially expressed genes 
Rhinn et al. found by comparing APOE4 car- 
riers and non-carriers, 20 are top candidate 
‘network node genes on the basis of their DCA 
ranking. Many of the node genes had been 
previously implicated in regulating amyloid 
precursor protein (APP) processing and intra- 
cellular sorting®, supporting their relevance. 

The authors performed various in vitro 
experiments to validate several of the node 
genes, in particular SV2A and RNF219. In cell 
lines and in human induced neurons, obtained 
by the process of cellular reprogramming, 
they show that SV2A affects APP processing 
through the regulation of endocytosis and 
intracellular trafficking. They also find that 
RNE219, which had not been previously asso- 
ciated with APP processing or Alzheimer’s 
disease, acts as a nuclear regulatory mediator 
of APOE4-induced APP endocytosis and its 
processing by the B-secretase enzyme. 

The researchers muster and analyse multi- 
ple independent data sets, including published 
GWAS, which provide further support for the 
key role of the DCA hits in APOE4-mediated 
susceptibility to Alzheimer’s disease, and make 
acase for further testing and replication studies 


using this gene set. For example, they show 
that, together with the APOE4 status, common 
genetic variants of RNF219 and FYN — another 
node gene with a role in endocytosis — are cor- 
related with age of LOAD onset. Indeed, using 
publicly available human positron-emission 
tomography imaging data from the Alzheimer’s 
Disease Neuroimaging Initiative, Rhinn et al. 
find that variations in RNF219 significantly 
affect APP processing in APOE4-negative 
healthy individuals — consistent with a prodro- 
mal role of this, and possibly other, A/E genes. 

Overall, the present paper significantly 
advances our knowledge of molecular and 
genetic mechanisms that modify LOAD risk, 
further implicating defects in endocytosis and 
intracellular trafficking in Alzheimer’s disease 
development”. The identification of key 
regulatory molecules and pathways involved 
in APOE-based risk for LOAD sets the stage 
for investigating their link to other deficits in 
this disorder, including immune function and 
metabolism of hyperphosphorylated tau pro- 
tein, which forms the disease’s characteristic 
neurofibrillary tangles. Ultimately, such inves- 
tigations should lead to molecularly targeted 
therapeutics. 

By demonstrating distinct mechanisms that 
depend on APOE status in LOAD, another 
implication of this work is that different genetic 
forms or subtypes of Alzheimer’s disease may 
have specific underlying mechanisms and 
respond differently to treatment. If so, as with 
cancer, therapies based on genetic and genomic 
diagnostics may be most effective. 

This work is also exemplary in demonstrat- 
ing the extraordinary value of publicly avail- 
able data resources. Published data on human 
gene expression, Alzheimer’s disease GWAS 
and neuroimaging provide the pillars of Rhinn 
and collaborators’ paper. Integrative analyses 
of these data by the authors, and previously 
by others*’, weaken the view that substantive 
biological experimentation only takes place 
at the wet bench, and highlight the value of 
innovative re-analyses of existing data®. m 
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50 Years Ago 


“The Delhi Pillar’ — The mystery 
which has enveloped this quite 
remarkable piece of metal should 
be once and for all removed as a 
result of three papers published 
inthe ... Journal of the National 
Metallurgical Laboratory of India ... 
All that has been previously written 
is summarized and the theories 
propounded to explain its resistance 
to corrosion are collected ... Made 
by hammer-welding small balls 

of native iron ... its resistance to 
corrosion during the 1,600 years or 
so of its existence is ascribed to the 
combined influence of a number 

of normal, favourable factors. 
Among these are included the 
climate and freedom from pollution 
of the atmosphere. Secondly, to a 
progressively decreasing rate of 
attack due to the building up ofa 
protective layer of oxide and scale 
during the early years of exposure. 
That in the past on ceremonial 
occasions the pillar was anointed 
appears to be probable ... This is 
supported by the X-ray analysis 

ofa minute sample of the surface 
coating ... which contained no iron 
and only quartz and chalk providing 
another source of protection. 

From Nature 3 August 1963 


100 Years Ago 


We have applied the new methods 
of investigation involving the use of 
X-rays to the case of the diamond, 
and have arrived ata result which 
seems of considerable interest. The 
structure is extremely simple. Every 
carbon atom has four neighbours 

at equal distances from it, and in 
directions symmetrically related 

to each other. The directions are 
perpendicular to the four cleavage 
or (111) planes of the diamond; 
parallel, therefore, to the four lines 
which join the centre ofa given 
regular tetrahedron to the four 
corners. W.H. Bragg & W. L. Bragg 

From Nature 31 July 1913 
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Spot the gorilla 


If it’s not relevant you may miss it. This 
phenomenon of inattentional blindness is 
well documented; in a classic study, most 
observers asked to monitor a video of a ball 
game missed a gorilla on the court. But would 
experts also miss a gorilla? Yes, according to a 
study on the subject (T. Drew et al. Psychol. Sci. 
http://doi.org/m9s; 2013). 

The authors asked 24 radiologists to 
search for lung nodules (small, bright 
circles) in several chest scans; the final scan 
(pictured) contained the image of a gorilla, 
48 times larger than an average nodule. 
Surprisingly, and perhaps worryingly, 83% of 
the experts missed the gorilla. It is somewhat 
reassuring that the miss rate among non- 
medical observers performing the same task 
was even higher. Sadaf Shadan 


NANOTECHNOLOGY 


~~ 


Tiny thermometers 
used in living cells 


Nanometre-scale thermometers that operate with millikelvin sensitivity have 
now been made from diamond crystals. The devices have been used to measure 
temperature gradients in living cells. SEE LETTER P.54 


KONSTANTIN SOKOLOV 


espite many promising studies, 
D taking temperature measurements 

of environments at nanometre-scale 
resolution remains a formidable challenge. On 
page 54 of this issue, Kucsko et al.’ report a pre- 
cious solution to this problem: a thermometer 
based on diamond nanocrystals, also known 
as nanodiamonds. This sensing tool could 
have many applications, ranging from studies 
of cell biology to measurements of nanoscale 
chemical reactions. 

Temperature affects diverse physical 
phenomena. For example, changes in Earth’s 
temperature patterns can lead to the forma- 
tion of severe storms, droughts and floods. 
Temperature governs the kinetics, activation 
and equilibrium states of chemical reactions. 
And in humans, body temperature is precisely 
controlled, so that any deviation from the nor- 
mal range triggers a cascade of biomolecular 
mechanisms to restore the body’s equilibrium. 
Scientists have therefore developed a range of 
precise temperature-measuring tools — from 
satellites to infrared cameras, and a variety of 
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more familiar thermometers — to measure 
temperature over length scales from multiple 
kilometres to submillimetres. But how can we 
measure temperature at length scales of a few 
micrometres, or a few tens of nanometres? 
Kucsko and co-workers’ approach is to use 
the unique properties of electron spins associ- 
ated with single-nitrogen-atom impurities in 
diamonds. The presence ofa nitrogen atom in 
a diamond's carbon-atom lattice creates a point 
defect called a nitrogen vacancy (NV) centre, 
in which the nitrogen and a vacancy replace 
two neighbouring carbons. The ground state of 
an NV centre is split into two energy levels: the 
spin state of the lower level is 0, whereas that 
of the higher level is 1. The energy difference 
between the levels, known as the ground-state 
energy gap, is highly sensitive to temperature 
because it varies in response to thermally 
induced lattice strains. The principle of dia- 
mond thermometry is based on accurate meas- 
urement of changes in the transition frequency 
associated with this energy gap — the micro- 
wave frequency that corresponds to the energy 
difference between the lower and higher levels. 
In their technique, Kucsko and colleagues 
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used green light to excite electrons in NV 
centres, which then decayed to the ground 
state by emitting red fluorescence. The inten- 
sity of the fluorescence depends on the spin 
state of the NV centres. The authors also 
irradiated their nanodiamonds with micro- 
waves to modulate the electron occupancy of 
the ground spin states 0 and 1, and determined 
the occupancy of the states from the observed 
fluorescence. They then used this information 
to work out the changes in the ground-state 
energy gap that are associated with tempera- 
ture variations. 

The researchers first used an isotopically 
pure (carbon-12 isotope) bulk diamond sam- 
ple to determine the ultimate sensitivity of 
their NV-based thermometry. In this system, 
they detected temperature changes with an 
accuracy of up to 1.8 millikelvins under ideal 
experimental conditions. Similar sensitiv- 
ity has just been reported by other groups”? 
using analogous experimental techniques and 
conditions. 

However, Kucsko et al. went further by 
demonstrating how nanodiamond thermo- 
meters can measure the temperatures in living 
cells (Fig. 1). They used a clever nanowire- 
assisted delivery method’ to position nano- 
diamonds and gold nanoparticles inside the 
cells. When excited by laser light, the gold 
nanoparticles acted as localized heat sources. 
By using their technique to measure sub-kelvin 
temperature changes inside a single cell, the 
authors directly monitored the amount of heat 
generated by a single gold nanoparticle that 
was required to kill the cell. 

Kucsko and colleagues’ nanodiamond tem- 
perature sensors have high spatial resolution 
together with sub-kelvin thermal sensitivity, 
chemical inertness, biocompatibility and the 
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Figure 1 | Temperature measurements in living cells. When diamond-lattice defects known as nitrogen 
vacancy (NV) centres are excited by green light, they emit red fluorescence. Kucsko et al.' inserted 
nanodiamonds into single living cells, and irradiated them with microwaves to modulate the electron 
occupancy of spin states in NV centres, and with green light. By measuring the fluorescence from these 
centres, the authors established the electron occupancy of the spin states, and so determined changes in the 
ground-state energy gap (the microwave frequency that corresponds to the energy difference between spin 
states) that are associated with temperature variations. In this way, they measured the temperature gradient 
generated when a gold nanoparticle in the cell was heated by a laser beam, achieving sub-kelvin sensitivity. 


best-known thermal conductivity of all solid 
materials. This is an ideal blend for a nano- 
thermometer. Furthermore, temperature sens- 
ing with nanodiamonds could be extended to 
in vivo applications if a different method for 
fluorescence excitation were adopted: micro- 
wave excitation of electronic spin states has 
already been carried out in animals’, and the 
use of ‘two-photor excitation would allow 
the analysis of deeper tissue than could be 
achieved with the present method. 

Because nanodiamonds are discrete objects, 
however, the authors’ method can take meas- 
urements only at distinct locations, rather 
than taking continuous measurements of a 
temperature field. Furthermore, the method 
monitors temperature variations rather than 
absolute temperature. The authors suggest that 
this limitation could be overcome by using 
ensembles of nanodiamonds or diamond 
samples in which the lattices have low strain, 
either of which would reduce the experimental 
variations that currently limit absolute tem- 
perature from being measured. At present, the 
technique also has a fairly low temporal reso- 
lution of tens of seconds. This is sufficient for 
measurements of many biological processes, 
such as changes in gene expression, but is too 
slow for studies of temperature effects in faster 
processes, for example the initial steps of signal 
transduction, or neural activity. 

How might this new tool further our under- 
standing of human cell biology, or enable 


biomedical advances? Most human cells are 
10-20 micrometres in size and are highly 
compartmentalized by internal membranes 
that separate cellular organelles. These orga- 
nelles create multiple micrometre-sized 
reactors in which a plethora of energy-produc- 
ing and energy-absorbing reactions occur. The 
reactions generate intracellular temperature 
gradients on micrometre and submicrometre 
scales that, in turn, influence other cellular 
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biochemical reactions. Furthermore, exter- 
nal biochemical signalling and environmental 
changes activate molecular responses inside 
cells that can lead to corresponding changes in 
intracellular temperature gradients. The ability 
to measure intracellular temperature precisely 
would therefore provide an invaluable tool 
for cellular biophysicists, potentially allowing 
cellular behaviour and characteristics to be 
manipulated by controlling the temperature 
within, or close to, cellular organelles. 

Kucsko and co-workers’ technique could 
also open up many other intriguing topics for 
research, including the thermal modulation of 
immune responses’, molecular mechanisms 
of therapeutic tissue preservation induced by 
local cooling’, the role of subcellular tempera- 
ture gradients in cell function’, and cell resist- 
ance to hyperthermia treatment” (deliberately 
induced elevated body temperature, used, 
for example, as anticancer therapy). When it 
comes to measuring temperature, it may be 
that diamonds are a scientist’s best friend. = 
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Angiogenic awakening 


Metastatic tumour cells often remain dormant for years. New findings suggest 
that endothelial cells lining blood vessels have a central role in regulating the 
transition from dormancy to metastatic growth. 


NETA EREZ 


he primary cause of cancer-associated 
deaths is the metastasis of a tumour to 
distant organs. Advanced metastatic 
cancers are mostly incurable, and available 
therapies can only prolong life to a limited 
extent. In many tumour types there is a long 
lag between the arrival of cancerous cells at 
distant locations and their colonization of 
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an organ, such that the formation of clini- 
cally evident metastases can take months to 
decades. This prolonged dormancy suggests 
that disseminated tumour cells must over- 
come growth-inhibiting signals from their 
new local environment in order to take over 
the metastatic organ’. The role of the micro- 
environment in supporting tumour growth 
at the primary site is well documented’, but 
the role of the metastatic microenvironment 
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Figure 1 | Temperature measurements in living cells. When diamond-lattice defects known as nitrogen 
vacancy (NV) centres are excited by green light, they emit red fluorescence. Kucsko et al.' inserted 
nanodiamonds into single living cells, and irradiated them with microwaves to modulate the electron 
occupancy of spin states in NV centres, and with green light. By measuring the fluorescence from these 
centres, the authors established the electron occupancy of the spin states, and so determined changes in the 
ground-state energy gap (the microwave frequency that corresponds to the energy difference between spin 
states) that are associated with temperature variations. In this way, they measured the temperature gradient 
generated when a gold nanoparticle in the cell was heated by a laser beam, achieving sub-kelvin sensitivity. 


best-known thermal conductivity of all solid 
materials. This is an ideal blend for a nano- 
thermometer. Furthermore, temperature sens- 
ing with nanodiamonds could be extended to 
in vivo applications if a different method for 
fluorescence excitation were adopted: micro- 
wave excitation of electronic spin states has 
already been carried out in animals’, and the 
use of ‘two-photor excitation would allow 
the analysis of deeper tissue than could be 
achieved with the present method. 

Because nanodiamonds are discrete objects, 
however, the authors’ method can take meas- 
urements only at distinct locations, rather 
than taking continuous measurements of a 
temperature field. Furthermore, the method 
monitors temperature variations rather than 
absolute temperature. The authors suggest that 
this limitation could be overcome by using 
ensembles of nanodiamonds or diamond 
samples in which the lattices have low strain, 
either of which would reduce the experimental 
variations that currently limit absolute tem- 
perature from being measured. At present, the 
technique also has a fairly low temporal reso- 
lution of tens of seconds. This is sufficient for 
measurements of many biological processes, 
such as changes in gene expression, but is too 
slow for studies of temperature effects in faster 
processes, for example the initial steps of signal 
transduction, or neural activity. 

How might this new tool further our under- 
standing of human cell biology, or enable 


biomedical advances? Most human cells are 
10-20 micrometres in size and are highly 
compartmentalized by internal membranes 
that separate cellular organelles. These orga- 
nelles create multiple micrometre-sized 
reactors in which a plethora of energy-produc- 
ing and energy-absorbing reactions occur. The 
reactions generate intracellular temperature 
gradients on micrometre and submicrometre 
scales that, in turn, influence other cellular 
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biochemical reactions. Furthermore, exter- 
nal biochemical signalling and environmental 
changes activate molecular responses inside 
cells that can lead to corresponding changes in 
intracellular temperature gradients. The ability 
to measure intracellular temperature precisely 
would therefore provide an invaluable tool 
for cellular biophysicists, potentially allowing 
cellular behaviour and characteristics to be 
manipulated by controlling the temperature 
within, or close to, cellular organelles. 

Kucsko and co-workers’ technique could 
also open up many other intriguing topics for 
research, including the thermal modulation of 
immune responses’, molecular mechanisms 
of therapeutic tissue preservation induced by 
local cooling’, the role of subcellular tempera- 
ture gradients in cell function’, and cell resist- 
ance to hyperthermia treatment” (deliberately 
induced elevated body temperature, used, 
for example, as anticancer therapy). When it 
comes to measuring temperature, it may be 
that diamonds are a scientist’s best friend. = 
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Angiogenic awakening 


Metastatic tumour cells often remain dormant for years. New findings suggest 
that endothelial cells lining blood vessels have a central role in regulating the 
transition from dormancy to metastatic growth. 


NETA EREZ 


he primary cause of cancer-associated 
deaths is the metastasis of a tumour to 
distant organs. Advanced metastatic 
cancers are mostly incurable, and available 
therapies can only prolong life to a limited 
extent. In many tumour types there is a long 
lag between the arrival of cancerous cells at 
distant locations and their colonization of 
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an organ, such that the formation of clini- 
cally evident metastases can take months to 
decades. This prolonged dormancy suggests 
that disseminated tumour cells must over- 
come growth-inhibiting signals from their 
new local environment in order to take over 
the metastatic organ’. The role of the micro- 
environment in supporting tumour growth 
at the primary site is well documented’, but 
the role of the metastatic microenvironment 
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Figure 1 | Instigation of angiogenesis enables tumour-cell exit from 
dormancy. Ghajar et al.* show that expression of the protein TSP-1 
by endothelial cells in stable blood vessels inhibits the proliferation of 
disseminated tumour cells (DTCs) residing on the blood vessels. 

By contrast, DTC proliferation is accelerated at the sprouting 

tips of new vessels, in part by endothelial-cell secretion of periostin, 


and the molecular interactions between dis- 
seminated cancer cells and stromal cells at 
the metastatic organ are poorly character- 
ized. Writing in Nature Cell Biology, Gha- 
jar et al.* report a mechanism by which the 
endothelial cells that line blood vessels regu- 
late the dormancy of disseminated breast 
cancer cells*. 

Using a mouse model of human breast- 
cancer metastasis, the authors show that 
dormant disseminated tumour cells (DTCs) 
reside on the endothelium of the microvas- 
culature in the lung, bone marrow and brain, 
which are common metastatic destinations of 
breast cancer. An in vitro three-dimensional 
model of microvasculature confirmed that 
this perivascular location of tumour cells is 
responsible for maintaining their quiescent 
state, and identified the protein thrombospon- 
din-1 (TSP-1), secreted by endothelial cells, as 
a suppressor of tumour-cell growth. 

Remarkably, this growth-suppressive micro- 
environment was found only around stable 
microvascular endothelium; the sprouting tips 
of newly forming vessels actually had an oppo- 
site, growth-accelerating effect on tumour 
cells. Unlike stable vessels, these growing tips 
were characterized by enhanced expression of 
the proteins periostin, tenascin-C, fibronec- 
tin and active tumour growth factor-B1 
(TGF-B1), which have all previously been 
implicated in the formation of the metastatic 
niche*’ (Fig. 1). 

These findings suggest that the vascular 
endothelium is an active participant in the 
formation of a growth-permissive microen- 
vironment. The growth-accelerating effect 
could be seen in vitro: reducing the number 
of endothelial tips by targeting Notch], a pro- 
tein that regulates vessel sprouting, suppressed 
the proliferation of breast-tumour cells in the 
microvasculature models. And when the 


*This News & Views article was published online 
on 24 July 2013. 
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authors injected the tumour cells into zebrafish 
that were genetically engineered to have exces- 
sive microvascular sprouting, they observed 
enhanced proliferation of the cells adjacent to 
new blood-vessel tips. 

Angiogenesis — the growth of new blood 
vessels — is crucial to the ability of tumours 
and metastases to thrive*”. Ghajar and col- 
leagues’ findings adda fascinating layer to this 
story by suggesting that metastatic propaga- 
tion may rely on the induction of new vascular 
growth not only because such growth supports 
nutrient and oxygen supply to micrometasta- 
ses, but also because it enables the exit of DTCs 
from dormancy. 

But can solitary DTCs exhibit such pro- 
angiogenic signalling? Dormant tumours are 
metabolically active and sustain a balance 
between angiogenic inhibitors and stimula- 
tors, remaining dormant until this balance is 
shifted towards the stimulation of angiogen- 
esis*’. However, it is not clear whether single 
dormant DTCs can induce angiogenesis. Most 
previous studies on angiogenesis and tumour 
dormancy have focused on how tumour-cell 
characteristics determine the nature of the 
tumour vasculature. But the present findings 
imply that, in some cases, the order of events 
may be reversed, such that the awakening of 
the vasculature precedes metastatic growth 
and is independent of pro-angiogenic signal- 
ling from dormant DTCs. 

In healthy adult tissues, the formation of 
blood vessels is a relatively infrequent event 
that is only triggered when certain signals lead 
to an increase in angiogenic activators — for 
example, following tissue-damaging events 
such as oxidative stress, inflammation or 
wound healing. If such events occurred in 
an organ in which DTCs reside, the result- 
ing instigation of new vascular growth might 
inadvertently switch the balance from growth- 
suppressive to growth-accelerating signals 
and thereby facilitate the exit of DTCs from 
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Metastatic tumour 


tenascin-C, fibronectin and TGF-f1. Instigation of angiogenesis (the 
formation of new blood vessels), for example following oxidative stress, 
inflammation or wound healing, at a site to which tumour cells have 
metastasized but are dormant, might thus awaken these cells and enable 
the formation of metastatic tumours. Proliferating tumour cells will express 
pro-angiogenic signals that feed back to amplify this process. 


dormancy. Thus, escape from dormancy may 
be, in some cases, secondary to the induction 
of angiogenesis at the metastatic site (Fig. 1). 

The ability of tumour cells to exit dormancy 
and proliferate seems to be the rate-limiting 
step of metastasis. Ghajar and colleagues’ 
results emphasize that to confront metastasis in 
a particular organ, the interactions of tumour 
cells with the microenvironment of that 
specific organ need to be elucidated. 

Numerous studies have previously demon- 
strated the central role of innate and adaptive 
immune cells in the formation ofa permissive 
metastatic environment”, and it is likely that 
immune cells and other cells at the metastatic 
site function together with the signals from 
endothelial cells to regulate tumour-cell dor- 
mancy. Ghajar et al. used immunodeficient 
mice in their experiments, but future studies 
in animal models with intact immunity may 
shed light on such interactions. Expanding our 
understanding of the early stages of metastatic 
growth will pave the way to new targeted thera- 
peutics aimed at preventing, rather than trying 
to cure, metastatic disease. m 
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Myc-driven endogenous cell competition 
in the early mammalian embryo 


Cristina Claveria', Giovanna Giovinazzo!, Rocio Sierra! & Miguel Torres! 


The epiblast is the mammalian embryonic tissue that contains the pluripotent stem cells that generate the whole embryo. 
We have established a method for inducing functional genetic mosaics in the mouse. Using this system, here we show 
that induction of a mosaic imbalance of Myc expression in the epiblast provokes the expansion of cells with higher Myc 
levels through the apoptotic elimination of cells with lower levels, without disrupting development. In contrast, 
homogeneous shifts in Myc levels did not affect epiblast cell viability, indicating that the observed competition results 
from comparison of relative Myc levels between epiblast cells. During normal development we found that Myc levels are 
intrinsically heterogeneous among epiblast cells, and that endogenous cell competition refines the epiblast cell 
population through the elimination of cells with low relative Myc levels. These results show that natural cell 
competition in the early mammalian embryo contributes to the selection of the epiblast cell pool. 


Multicellular organisms have evolved tissue homeostasis mechanisms 
to ensure life-long fitness of their organs and systems. Among these 
mechanisms, those that regulate the elimination of unwanted cells are 
fundamental for tissue development and homeostasis’. Whereas overtly 
damaged or hyperproliferative cells usually trigger well-characterized 
cell-autonomous apoptotic pathways’, mechanisms that survey viable 
cell fitness to optimize a tissue’s cell composition are less well under- 
stood. A candidate mechanism is the phenomenon of cell competition, 
first described in Drosophila’. Studies in flies have shown that the cells of 
growing organs are able to compare their fitness with that of neighbour- 
ing cells, and the less-fit but otherwise viable cells are eliminated (out- 
competed) when confronted with a fitter cell population (competitor)**. 
Cell competition for survival is an active process because, in addition to a 
contribution from differential proliferation’, it is executed through the 
apoptotic elimination of the less-fit population by cell non-autonomous 
mechanisms”. Cell parameters that trigger competition in flies include 
differences in protein synthesis capacity, growth factor receptivity and the 
expression level of dMyc*°"°, a major activator of cell anabolism. Super- 
competition is a variant of cell competition in which cells moderately 
overexpressing dMyc outcompete wild-type (WT) cells*. The replace- 
ment of cell populations through cell competition is phenotypically 
silent, because the competitor cells mostly conform to size-control 
mechanisms**. This process is potentially important for the long-term 
maintenance of tissue performance, as it might provide a mechanism for 
elimination of suboptimal cells from stem-cell niches and progenitor- 
cell pools (reviewed in refs 11, 12). In addition, cell competition for sur- 
vival might serve to enhance tissue replacement during regeneration”. 
However, despite its potential importance, cell competition for survival 
has so far only been observed after experimental induction* *'*"°, and 
its endogenous role has not been uncovered. Here we use a genetic 
mosaic approach to induce supercompetition in the mouse embryo 
conditionally, and identify endogenous cell competition in the epiblast 
as a mechanism for the selection of cells with higher anabolic activity 
(Supplementary Fig. 1). 


Myc-enriched cell expansion in embryos 


To develop a methodology for analysing cell competition in mam- 
mals, we devised a recombinase-based system for producing inducible 


random genetic mosaics (iMOS) (Supplementary Information). This 
system allows the conditional generation of random mosaics expressing 
two alternative transcripts, T1 or T2, from the ubiquitously expressed 
ROSA26 locus (Supplementary Fig. 2a). Once generated, the mosaic cell 
populations are stable (Supplementary Fig. 3) and can be independently 
tracked by the expression of IRES-EYFP inserted in T1 and IRES-ECFP 
in T2 (Supplementary Fig. 2a). Recombination of the iMOS™” allele 
in the epiblast using Sox2-cre (ref. 17) produced random mosaicism for 
the expression of the two reporter genes in all embryonic tissues (Sup- 
plementary Fig. 2b-i). Quantification by flow cytometry and confocal 
microscopy at embryonic day (E)9.5 showed a 1:3 ratio of enhanced 
cyan fluorescent protein (ECFP) to enhanced yellow fluorescent protein 
(EYFP) cells (Supplementary Fig. 2j, k). 

To test whether increasing the dose of Myc expands cell popula- 
tions in the mouse embryo, we generated an iMOS allele ((MOS™'™”) 
in which Myc was inserted before IRES-EYFP in T1 (Supplementary 
Fig. 4a). Sox2-cre recombination of the iMOS"™' allele produced 
mosaic Myc expression at a similar level to a wild-type allele (Sup- 
plementary Fig. 5 and Supplementary Information) and resulted in 
significant rescue of the Myc null phenotype at E10.5 (Supplementary 
Fig. 4b-h), indicating that Myc overexpression occurs within a physi- 
ological range. 

Rescued embryos had an elevated EYFP/ECFP expression ratio, 
suggesting an increase in the proportion of Myc-enriched cells (Sup- 
plementary Fig. 4i-n). We next induced iMOS" mosaics in three 
Myc backgrounds: wild type, Myc’’~ and Myc ‘~ (Fig. 1a). This strat- 
egy allowed Myc-enriched EYFP cells to be confronted with Myc-non- 
enriched ECFP cells at increasing ratios of relative Myc content (Fig. 1a). 
Compared with the iMOS™” mosaics, E10.5 iMOS'”* mosaics 
showed a shift towards Myc-linked EYFP expression (Fig. 1b and Sup- 
plementary Fig. 40). Moreover, increasing Myc dose imbalance between 
the two cell populations correlated with an increasing EYFP/ECFP 
signal ratio (Fig. 1b and Supplementary Fig. 40). Quantification by 
confocal microscopy and flow cytometry at E10.5 showed that the 
proportion of Myc-non-enriched ECFP cells decreased in correlation 
with the imbalance in Myc dose (Fig. 1c), but that the changes in Myc 
dose did not affect cell size or embryo size (Supplementary Fig. 6). The 
shift in cell population proportions thus results from the replacement of 
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Figure 1 | Myc overexpression promotes dose-dependent expansion of cell 
populations in mosaic embryos. a, Myc gene doses in Sox2-cre-recombined 
iMOS™* and iMOS™™”* embryos in different endogenous Myc backgrounds. 
Myc dose indicates the numbers of Myc copies in EYFP and ECFP cells. 

b, Overlay views of EYFP and ECFP signals in E10.5 mosaics. Scale bars, 1 mm. 
c, d, ECFP cell relative percentages in confocal sections. Graphs show 

means + s.d. of cell percentages measured in individual embryos collected from 
various similar litters (biological replicates). n = 8,7, 6, 11, 9, 5, 10, 6 (from left 
to right) embryos in c and n = 8, 6, 7, 12, 7, 11 embryos in d; **P < 0.01; 
***P <0).001 by ANOVA followed by Tukey honestly significant difference 
(HSD) tests. 


Myc-non-enriched cells by Myc-enriched cells. Notably, cells with wild- 
type levels of Myc are depleted when confronted with cells with higher 
Myc expression, but cells with absolute Myc levels similar to wild type 
(1 wild-type Myc + 1 iMOS™™ alleles) or lower (1 iMOS™’™ allele) 
expand when confronted with cells with lower relative Myc levels, indi- 
cating that the key to this phenomenon is the Myc dose difference 
between cells. iMOS”* and iMOS'’”* mosaics in a wild-type back- 
ground both survive to adulthood and display no evident phenotypic 
alterations (n > 150 and n> 180, respectively). Enrichment in Myc- 
overexpressing cells thus results from a cell population replacement 
process that is phenotypically silent. 

To track the depletion of wild-type cells in iMOS mosaics, we 
measured the proportion of ECFP cells at different stages. At E6.5, 
proportions did not differ significantly between iMOS™”’ and 
iMOS''™”* mosaics (Fig. 1d); but at E9, the frequency of wild-type 
cells was reduced by a half, and then remained unchanged at E10.5 
(Fig. 1d). Wild-type cell depletion is thus not continuous within this 
period, but takes place in a short time window early in embryogenesis. 
The frequency of Myc-non-enriched cells in iMOS'’™* mosaics 
induced in Myc*/~ and Myc ‘~ backgrounds already showed a tend- 
ency to reduce at E6.5 (Fig. 1c), indicating that the cell population 
replacement is active at early embryonic stages. 


T1-Myc 


Myc-induced expansion requires apoptosis 

To explore how embryos become enriched in Myc-overexpressing 
cells, we first studied cell proliferation. ECFP and EYFP cell popula- 
tions in iMOS''™” mosaics showed no significant differences in 
pH3* cell frequency at E6.5 or in 5-bromodeoxyuridine (BrdU) * cell 
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frequency at E6.5, E7.5 and E9.5 (Supplementary Fig. 7 and Sup- 
plementary Information). 

To evaluate the contribution of cell death to the depletion of wild-type 
cells in iMOS"'™””” mosaics, we generated a new iMOS allele 
(imos™'™”“7?P>°) incorporating Myc and p35 in T1 and T2 (Fig. 2a). 
In these mosaics, the Myc-overexpressing cell population is con- 
fronted with a Myc-non-enriched cell population expressing the apop- 
tosis inhibitor p35 (Fig. 2a). p35 expression in iMOS'??* mosaics 
inhibits apoptosis in the mouse embryo (Supplementary Fig. 8 and 
Supplementary Information). In iMOS™ 7? mosaics, the higher 
proportion of EYFP signal seen in iMOS''™”” embryos at E9.5 was 
abolished (Fig. 2 b-d). This was also evident on confocal sections and by 
flow cytometry (Fig. 2e-h). Quantitative confocal analysis showed that 
both the depletion of wild-type ECFP cells in iMOS"’™”* mosaics and 
their rescue in iMOS"'"’?P*> mosaics were homogeneous in differ- 
ent tissues (Fig. 2i). iMOSTE™/T2-P35 mosaics survive to adulthood, 
and display no obvious phenotypic alteration (m > 50). The inhibition 
of apoptosis in the wild-type cell population thus prevents its depletion 
and the concomitant expansion of the Myc-enriched cell population. 


Cell competition occurs in the epiblast 

The fact that Myc-overexpressing cells do not increase between E9 
and E10.5, together with the homogeneity of wild-type cell depletion 
and p35 rescue in different embryonic tissues, suggests that wild-type 
cell depletion occurs earlier, in a homogeneous precursor population. 
We therefore analysed the epiblast, which contains all embryonic pre- 
cursors. In agreement with previous studies"’, the incidence of apop- 
tosis in iMOS™” mosaics was high, and was restricted to the epiblast, 
without affecting extraembryonic regions (Fig. 2j). Mosaic Myc 
expression sharply increased cell death in the epiblast (Fig. 2k, p). 
Expression of p35 in the complementary cell population completely 
rescued the increased cell death induced by mosaic Myc overexpres- 
sion (Fig. 21, p). 

Scoring of TdT-mediated dUTP nick end labelling (TUNEL) staining 
in EYFP and ECFP populations (Fig. 2m-o, q) showed that the increased 
cell death in iMOS"’”” embryos mostly affects the wild-type ECFP 
population, with only a minor contribution from Myc-overexpressing 
EYFP cells (Fig. 2n, q). The increased incidence of TUNEL-positive 
ECEP cells was abolished in iMOST 77? mosaics (Fig. 20, q). 
Moreover, iMOST1/1?-P35 embryos lacked the moderate increase in 
cell death detected in the Myc-overexpressing EYFP cell population in 
iMOS™!™”* mosaics (Fig. 20, q). 

We next quantified apoptosis in embryos homogenously overex- 
pressing Myc and in embryos with reduced (Myc*'~) or absent 
(Myc ‘~) Myc expression. In contrast with the results obtained with 
the mosaics, apoptosis was not modified in any of these conditions 
(Supplementary Fig. 9). These results exclude cell-autonomous induc- 
tion of apoptosis by Myc overexpression or reduction. Wild-type cell 
depletion in iMOS™'* mosaics is thus driven by a cell competition 
process implemented through non-autonomous induction of cell 
death in the mouse epiblast. 


Cell competition death mechanisms 

Marker analysis in iMOS™'™” mosaics revealed activation of the 
intrinsic but not the extrinsic apoptotic pathway in epiblast cell death 
(Supplementary Fig. 10 and Supplementary Information). To dissect the 
mechanism of cell-competition-induced apoptosis further, we deve- 
loped an in vitro drug-inducible iMOS model of Myc supercompetition 
in embryonic stem (ES) cells. Myc drives cell competition in this model 
in a manner similar to that observed in the mouse epiblast (Sup- 
plementary Information and Supplementary Fig. 11). To characterize 
the interactions required for cell competition, we induced recombina- 
tion in homozygous iMOS"! ES cell cultures and scored apoptosis in 
wild-type cells residing in unrecombined colonies, wild-type cells in 
recombined colonies but not contacting EYFP cells, and wild-type cells 
contacting EYFP cells (Supplementary Fig. 12a and Fig. 3a—c). Increased 


©2013 Macmillan Publishers Limited. All rights reserved 


> 


iMO. ST -Myc iMOS T1-Myc/T2-p35 


co imos“7 


tekken tok eek 
—r ri 


f imMos™ Myc 
BB {MOSTI-Myc/T2-p35 


tekken 


a T 7 


a 
ROSA26 LD PGK-neo 
Mosaics » 
oa 
™ 
oO 
Ww 
+ 
a 
i 
> 
Ww 

h 

a) 
2 
Ee 
5 
fe} 
1S) 
ed 
Wu 
z 
=) 
i 
+ 
[oi 
> 
Ww 
p 


Somites Head mes. Neural tube 


iMos™ -Myc iMO. su -Myc/T2-p35 


ARTICLE 


apoptosis with respect to uninduced cultures was observed only in 
wild-type cells in direct contact with Myc-overexpressing EYFP cells 
(Fig. 3d). A similar result was obtained for EYFP-cell-contacting and 
non-contacting wild-type cells in the iMOS''™” embryonic epiblast 
(Fig. 3e, fand Supplementary Video 1). In addition, exposure of unin- 
duced ES cell cultures to cell-competition-conditioned medium did 
not increase apoptosis or modify proliferation rates (Supplementary 
Fig. 12b-e). These results indicate that cell competition requires direct 
contact or very short-range interactions and that, under these experi- 
mental conditions, diffusible signals are not sufficient to induce it. 
We next studied the mechanism by which apoptotic cells are elimi- 
nated in mammalian cell competition. Previous studies showed that 
dying cells in the mouse epiblast are removed through phagocytosis 
by neighbouring cells'’. We found that cells showing apoptosis mar- 
kers are engulfed by neighbouring cells in epiblasts and ES cell cultures 
undergoing cell competition (Supplementary Fig. 13a, c, d, f). Some 
phagocytic events take place in the absence of TUNEL staining, sug- 
gesting that engulfment by neighbours is an early event in epiblast and 
ES cell apoptosis (Supplementary Fig. 13b, e). These results indicate 
that mammalian cell competition involves phagocytosis by neighbour- 
ing cells, at least in situations in which there are no macrophages. 


Myc-driven endogenous competition 
The high cell death rate in the epiblast and its sensitivity to induced 
supercompetition suggest that it might be a site of endogenous cell 
competition. We thus studied the distribution of endogenous Myc in 
the early mouse embryo. Up to E6.75, Myc mRNA and protein are 
expressed in the epiblast and extraembryonic tissues at levels that vary 
from cell to cell in an apparently random distribution (Supplementary 
Figs 14-16), contrasting with the homogenous expression of the epi- 
blast marker Oct4 (Supplementary Fig. 17). By E7.0 embryonic Myc 
expression ceases in the epiblast, restarting at E7.5 in a homogeneous 
distribution (Supplementary Figs 14 and 15). Embryonic Myc expres- 
sion is thus heterogeneous until E6.75, in a pattern that is not region- 
ally modulated but is instead intrinsic to individual cells. Myc mRNA 
and protein have a fast turnover and Myc protein abundance is pre- 
dominantly regulated by mRNA availability”, indicating that the vari- 
able protein levels result from the variable mRNA levels. To confirm 
this, we performed per-cell quantification of Myc immunofluorescence 
(Methods and Supplementary Fig. 18) to determine the distribution of 
Myc levels in wild-type embryos and in iMOS™™™”; Myc”'~ E6.5 
embryos, in which Myc mRNA is exclusively produced from the 
ROSA26 allele. Myc protein levels in iMOS™'™; Myc’/~ mosaics 
were more homogeneous than in wild-type embryos (2.2-fold higher 
residual variance in wild type; P< 0.001) (Fig. 4a—h). The heterogen- 
eity of endogenous Myc protein expression thus seems to result mostly 
from variable transcriptional activity of the Myc gene. 

Further analysis indicated that the variability in Myc expression levels 
cannot be attributed to variations during the cell cycle (Supplementary 


Figure 2 | Expansion of Myc-overexpressing cells requires apoptosis of 
wild-type cells in the epiblast of early mosaic embryos. a, ROSA26-targeted 
iMOS!!™/12-P35 construct and the three mosaics studied. b-d, Overlay views 
of EYFP and ECFP signals in E9.5 mosaics. Scale bars, 1 mm. e-g, Confocal 
sections showing head mesenchyme corresponding to the boxed areas in 
b-d. Scale bars, 50 um. h, Flow cytometry analysis of EYFP and ECFP cells in 
disaggregated representative E9.5 mosaics. Arrowheads indicate ECFP cell 
counts. i, ECFP proportion in confocal sections of E9.5 mosaics (n = 7, 8, 8 
(from left to right) embryos). j-o, TUNEL staining in whole E6.5 mosaics. 
j-l, Whole-embryo confocal maximal projections. PS, pre-streak. m-o, Details 
of confocal sections. TUNEL-positive EYFP and ECFP cells are indicated by 
filled and open arrowheads, respectively. Scale bars, 100 um (j-1); 25 um 
(m-o). p, TUNEL-staining frequency in E6.5 mosaics. q, TUNEL-staining 
frequency in EYFP and ECFP populations in the E6.5 mosaics (n = 4, 4, 4 
embryos) Embryos in i, p and q were collected from various similar litters and 
represent biological replicates. Graphs show means + s.e.m. in i and s.d. in 

p, q; **P < 0.01; ***P < 0.001 by unpaired two sample t-test. 
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Figure 3 | Myc-induced cell competition is triggered by short-range 
interactions. a—c, Confocal sections showing TUNEL staining and EYFP in ES 
cell colonies. Scale bars, 50 jm. d, TUNEL-staining percentage in wild-type 
(EYFP-negative) cells (n = 1,331, 773, 643, 858 total cells from 5 different 
cultures). e, Confocal three-dimensional reconstruction ofa representative E6.5 
iMos!! Myc epiblast, showing EYFP, active caspase-9 and active Bax. Scale bar, 
50 um. f, Percentages of apoptotic wild-type cells in e (m = 2,112 total cells). 
Graphs show means and 95% confidence interval; **P < 0.01; ***P < 0.001 by 
binomial test. 


Information and Supplementary Fig. 19). We next measured Myc levels 
in non-apoptotic cells and in cells in the first phases of apoptosis (cells 
with active caspase-3 staining but still-intact nuclei) (Fig. 4i). Myc levels 
in non-apoptotic cells were on average >2 times higher than those in 
early apoptotic cells (Fig. 4j). This result is not due to Myc degradation 
during apoptosis (Supplementary Information and Supplementary 
Fig. 20), and thus suggests that intrinsically low levels of Myc protein 
correlate with natural apoptosis in the epiblast. Nonetheless, the detection 
of some apoptotic cells in the EYFP cell population of the iMos'’™"; 
Myc ‘~ embryos (Supplementary Fig. 20) indicates that not all epiblast 
cell death correlates with heterogeneous Myc levels. 

Endogenous Myc levels were also variable in wild-type ES cells 
(Fig. 5a, b), and pairs of engulfing-engulfed cells were abundantly 
detected in wild-type ES cell cultures (Supplementary Fig. 21). 
Analysis of TUNEL-negative engulfing—engulfed cell pairs in wild- 
type ES cell cultures revealed that Myc levels in engulfing cells were 
double those in their engulfed partners (Fig. 5c-e), again indicating a 
strong correlation between Myc levels and natural cell death. 

Given that anabolic capacity is a major determinant of cell com- 
petition ability, we determined the relationship between endogenous 
Myc levels and anabolic status by measuring the protein synthesis 
rate in ES cell cultures in situ (Fig. 5f-h). Protein synthesis rate cor- 
related positively with the natural variations in Myc levels (Fig. 5i). 
Furthermore, protein synthesis rate was enhanced by the extra Myc 
copies in iMOS"’™”” ES cells (Fig. 5j), indicating that Myc is a ‘driver’ 
and not a ‘reader’ of cell anabolism. Scoring of protein synthesis rate 
in engulfing—engulfed pairs of cells showed higher rates in engulfing 
cells (Supplementary Fig. 22). 

Our findings strongly suggest that endogenous cell competition elim- 
inates cells with low relative Myc levels in wild-type epiblasts and in ES 
cell cultures. At least part of the natural epiblast cell death would thus be 
a consequence of endogenous cell competition. Supporting this view, 
wild-type embryos express the same apoptosis markers detected in 
iMOS'* mosaic embryos and also show phagocytosis between epi- 
blast cells (Supplementary Fig. 21). We therefore used the iMOS'*?* 
mosaics to study the consequences of inhibiting natural cell death in the 
mouse epiblast. TUNEL staining in the p35-expressing ECFP popu- 
lation was lower than in the wild-type EYFP population (Fig. 6a, b, e). 
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Figure 4 | Epiblast cells with low Myc expression preferentially undergo 
natural apoptosis. a, b, d, e, Confocal sections showing Myc expression. Scale 
bars, 50 jum. ¢, f, Colour coding of Myc levels ina (total cells) and d (EYFP cells); 
green indicates the lower and blue the higher Myc levels in arbitrary 
fluorescence units. g, Detail of Myc expression in the wild-type epiblast. Scale 
bar, 50 um. h, Distribution of Myc levels (arbitrary fluorescence units; AFU) in 
total cells of E6.5 wild-type embryos (n = 5 embryos, 543 cells) and EYFP cells 
of E6.5 iMos!!™, Myc ’~ mosaics (n = 3 embryos, 267 cells). i, Detail of 
Myc and active caspase-3 expression in the wild-type epiblast. Scale bar, 50 [um. 
j, Distribution of Myc levels in non-apoptotic and apoptotic wild-type epiblast 
cells (n = 8 embryos, 2,376 cells). Embryos in h and j were collected from 
various genetically equivalent litters and represent biological replicates. Dashed 
lines in h and j indicate averages. ***P < 0.001 by a mixed linear model with 
different variances per level. 


This reduction led to an enrichment in ECFP cell proportion at E9.5 
(Fig. 6c, d, f), indicating that naturally dying cells in the epiblast can be 
rescued and contribute to the developing embryo. Moreover, iMOS'*?*? 
mosaics reared to adulthood showed no obvious phenotypic alteration 
(n > 40). Thus, consistent with the idea of cell competition as a driver of 
epiblast apoptosis, naturally dying epiblast cells are viable and can be 
rescued to form part of the organism. Interestingly, whereas low Myc 
levels again correlated with apoptosis in the EYFP cell population 
of iMOS'??* mosaics, within the p35-expressing cell population 
Myc levels did not differ between the apoptotic and non-apoptotic cells 
(Fig. 6g). This result again shows that not all epiblast cell death results 
from Myc level heterogeneity, and that, in this setting, p35 can prevent 
the competition-driven death of viable cells but cannot rescue other 
types of epiblast cell death, presumably affecting non-viable cells. 
Our findings predict that endogenous cell competition should select 
for cells with higher Myc levels and thus, that inhibition of cell com- 
petition should prevent this selection. Confirming this, mean Myc 
levels in the apoptosis-rescued p35-expressing ECFP population of 
iMOS'???° mosaics were 10% lower at E5.75 and 24% lower at E6.75 
than in the wild-type EYFP population (Fig. 6h-l). Thus, prevention of 
natural epiblast cell death results in the progressive accumulation of cells 
with lower relative Myc content, which would have been eliminated in a 
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wild-type context. Conversely, when iMOS"’™” mosaics are induced 


ona Myc*’~ background, Myc levels in the outcompeted cell population 
are significantly higher than those observed in Myc ’/~ embryos without 
induced cell competition, as predicted by the enhanced elimination of 
Myc-low cells in the mosaic embryos (Fig. 6m). These results show that 
endogenous cell competition selects for cells with higher Myc levels in 
the epiblast. 


Discussion 

Our results show that cell competition is promoted by an imbalance in 
Myc dose between neighbouring cells in the mouse epiblast and ES 
cells. Crucially, it is not the absolute levels of Myc but rather the 
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Figure 5 | Cell death in ES cells correlates with low Myc levels and anabolic 
activity. a, b, Confocal sections of uninduced (WT) imMos™*7imos't 
ES cells, showing endogenous Myc expression and TUNEL staining. Scale bar, 
50 um. ¢, Detail of cell engulfment. Scale bar, 10 jim. d, e, Myc level paired ratios 
(d) and global distributions (e) in engulfing and engulfed TUNEL-negative 
wild-type ES cells (n = 39 pairs) and cells not involved in phagocytosis (‘other’; 
n= 1,193 cells) in a representative culture. f-h, Confocal sections of wild-type 
ES cells showing Myc expression and protein synthesis labelling with 
L-azidohomoalanine (AHA). Scale bar, 50 um. i, Correlation between 
endogenous Myc and protein synthesis in wild-type ES cells (m = 1,208 cells) in 
a representative culture. j, Increase in protein synthesis in induced iMOS™! "7 
iMOS''™ mosaic ES cells expressing extra Myc copies (EYFP) (n = 568 (left) 
and 589 (right) cells from a representative culture). Graph shows means and 
95% confidence intervals. Horizontal bars in d indicate mean ratios. Dashed 
lines in e indicate mean value. ***P < 0.001 by one sample t-test P value 

(d), Gaussian linear model (e) and linear model (j). 
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relative levels among neighbouring cells that determine cell viability, 
and thus the natural cell competition detected in the mouse epiblast 
leads to the elimination of cells with lower Myc expression. These 
findings uncover a new mechanism for refining the epiblast cell popu- 
lation. We propose a model in which epiblast cells are subject to 
comparison with their neighbours, resulting in the elimination of cells 
with lower anabolic capacity, even though in the absence of more 
competitive cells they would contribute to the new organism (Sup- 
plementary Fig. 1). Such a mechanism, by ensuring that the most 
capable of the available precursor cells are selected to generate the 
new organism, would be especially relevant to long-lived organisms, 
in which somatic tissues need to maintain functionality for long 
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Figure 6 | Endogenous cell competition in the mouse epiblast selects cells 
with higher relative Myc levels. a, b, Confocal sections of TUNEL-stained E6.5 
mosaics. Scale bars, 50 um. ES, early streak. c, d, Overlay views of EYFP and 
ECFP in whole-mount E9.5 mosaics. Scale bars, 1 mm. e, f, Relative TUNEL- 
staining frequencies (n = 9 embryos, 10,116 cells) (e) and relative ECFP cell 
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Graphs show means = s.d. g, Distribution of Myc levels in cell populations of 
E6.5 mosaics (n = 3 embryos, 2,112 cells). h-k, Confocal section showing EYFP 
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Scale bar, 50 um. |, Distribution of Myc levels in representative mosaics at E5.75 
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m, Distribution of Myc levels in the indicated E6.5 embryo genotypes and cell 
populations (n = 494 (grey), 354 (green) and 170 (blue) cells from 
representative embryos). Embryos in e, f, g and 1 were collected from various 
similar litters and represent biological replicates. *P < 0.05; **P < 0.01; 

**P < (),001 by one-sample t-test (e and f), mixed linear model (g and I) and 
generalized gamma lineal model (m). 
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periods. Under this interpretation, cell competition in the epiblast, 
which contains the precursors for the whole organism, would have the 
most far-reaching impact. These ideas are consistent with the finding 
that only epiblast cells have a high incidence of cell death, whereas 
extraembryonic tissues, which are short-lived, do not. 

The description of endogenous cell competition in mammals opens 
the door to investigation of the role of this pathway in human diseases 
and processes in which tissue homeostasis is central, such as tumour 
formation, degenerative diseases and tissue regeneration. Our find- 
ings indicate that cell competition is a general feature of metazoans, 
thus suggesting that it represents a fundamental property of cell inter- 
action within tissues. 


METHODS SUMMARY 


DNA constructs, mouse strains, ES cell derivation and assays, three-dimensional 
embryo reconstruction, quantification of Myc protein and protein synthesis, and 
statistical analysis are detailed in the Methods. Confocal microscopy, flow cyto- 
metry, measurement of embryo total DNA content, immunofluorescence, 
TUNEL assay, BrdU labelling and in situ hybridization were conducted by stand- 
ard procedures described in the Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


DNA constructs. iMOS DNA constructs were assembled by standard cloning 
methods, starting from pBigT vector’'. loxP sites were transformed into lox2272 
sites’? using the QuickChange site-directed mutagenesis kit (Stratagene). lox5171 
sites”’ were inserted as annealed oligonucleotides. IRES-EYFP and IRES-ECFP were 
excised from pEYFP-N1 and pECFP-N1 vectors (Clontech), and were respectively 
inserted into the Tl and T2 units of iMOS constructs (Supplementary Fig. 2a). A 
triple SV40 polyadenylation sequence (tPA) was excised from pBigT and inserted 3’ 
of EYFP (Supplementary Fig. 2a). 

To ensure that T1 and T2 recombination frequencies were the same in all the 
iMOS mouse lines, a constant distance was maintained between lox site pairs in all 
constructs. In the iMOS“’” construct (Supplementary Fig. 2a), the buffer sequ- 
ences corresponded to non-translatable mutants of the ORFs for mouse Myc (T1) 
and baculovirus p35 (T2). The mutated Myc ORF (Myc? ) was generated 
by PCR-mediated insertion of a T at the beginning of the fourth codon 
(igAAC > 10TAA12). The mutated p35 ORF (p35"’?) was obtained by PCR- 
mediated T>A_ substitution of nucleotide six (4TGTs>4TGAg). In the 
iMos''™ construct, the wild-type Myc ORF was inserted 5’ to IRES-EYFP, 
in place of the T1 buffer sequence, and the p35“? ORF was used as the T2 buffer 
sequence (Supplementary Fig. 4a). In the iMOS'?** construct, the wild-type p35 
ORE was inserted 5’ to IRES-ECFP, in place of the T2 buffer sequence, and the 
Myc*’? ORE was used as the T1 buffer sequence (Supplementary Fig. 8a). In the 
iMOS™™/1?-P35 construct, the wild-type Myc ORF was inserted 5’ to IRES- 
EYFP and the wild-type p35 ORF was inserted 5’ to IRES-ECEP, replacing the 
T1 and T2 buffer sequences (Fig. 2a). The KpnI site in the pROSA26PA vector” 
was substituted by a Swal site. pBigT-inserted iMOS constructs were cloned into 
Pacl/Ascl sites of this modified pROSA26PA vector for gene targeting in ES cells. 
Mouse strains. To generate iMOS mice, the iMOS targeting vectors were linear- 
ized at the Swal site and electroporated into ES cells”*. ES cell clones were screened 
by Southern blot**. Chimaeric mice were generated by ES cell aggregation. Mice 
were genotyped by PCR”. 

For mosaic induction, homozygous iMOS females (mixed C57BL/6 and R1 
ESC background) were crossed with heterozygous Sox2-cre males (stock 004783, 
Jackson Laboratory)’’. The stability of the iMOS system was tested by crossing 
homozygous iM OS"? males with Sox2-cre females (Supplementary Fig. 3a). 
Males derived from these crosses were used to establish a germ-line-recombined 
T1-expressing mouse line (iMOS“*") (Supplementary Fig. 3a). Females from 
this line were crossed with Sox2-cre males (Supplementary Fig. 3a). 

To characterize the functionality of Myc expressed from the iMOS"!” allele, 
Sox2-cre males were crossed with Myc” s/f females®> to obtain Sox2-cre mice 
with one Myc allele deleted (Sox2-cre; Myc*’~) (Supplementary Fig. 4b). Mouse 
lines homozygous for Mye!”* and iMOS™* or iMOS'*™” were generated, and 
females from these lines were crossed with Sox2-cre; Myc*’~ males (Supplemen- 
tary Fig. 4b). Once they are Cre-deleted, the Myc"* alleles are represented as 
Myc . 

To test the effect of Myc expression from the iMOS"” allele under different Myc 
genetic backgrounds, we crossed homozygous iMOS"" and iMOS™” females with 
Sox2-cre males to obtain imos“?, Sox2-cre and iMos!?™” °; Sox2-cre mosaic 
embryos in a wild-type Myc background. In parallel, homozygous iMOs™™”; 
Myc"** females were crossed with Sox2-cre; Myc*!~ males to obtain iMOST!™"; 
Sox2-cre mosaic embryos in the Myc*’~ and Myc ‘~ backgrounds. Embryos homo- 
geneously overexpressing Myc (iMOS%*®™'™") were generated by crossing 
iMOS™'®; Sox2-cre females with CD1 males. To generate Myc*!~ and Myc !~ 
embryos, Sox2-cre; Myc*!~ mice were crossed. 

iMos'™™”}iMOS'™; Myc*"* crosses with Sox2-cre; Myc*'~ males were 
used for the validation of Myc quantification (Supplementary Fig. 18) and for 
experiments in Fig. 6m. 

Tamoxifen-inducible mosaics were generated by crossing homozygous iMOS 
mice with homozygous RERT mice, which express the inducible Cre-ERT? 
recombinase under the control of the large subunit of RNA polymerase II locus”®. 

Early embryos were staged as PS (pre-streak), ES (early streak), MS (mid- 
streak) and LS (late streak), according to ref. 27. 

No randomization was used in any experiment and experiments were not 
blinded. No experiments were performed in adult animals. 

All animal procedures have been reviewed and approved by the CNIC Animal 
Experimentation Ethics Committee, according to the National and European 
regulations. 

Confocal microscopy. Whole embryos or histological sections and ES cell cultures 
were imaged with a Leica TCS SP5 confocal microscope at 405, 458, 488, 568 or 
633 nm, with X20/0.7 multi-immersion, *40/1.25 oil, X63/1.4 oil and X63/1.2 
water objectives. Areas occupied by EYFP and ECFP cells were quantified using the 
threshold detection and particle analysis tools in ImageJ (NIH, http://rsb.info. 
nih.gov/ij). To calculate the relative frequency of ECFP cells, the percentage of 
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ECFP cells in each embryo was divided by the average percentage in iMOS™’; 
Sox2-cre embryos. 

Flow cytometry. To quantify EYFP- and ECFP-expressing cell populations, cells 
from E9.5 Sox2-cre-recombined iMOS™', iMOS™!™”” and iMOst? 9/1? P35 
embryos were suspended in PBS containing 1% fetal calf serum (FCS), 25 mM 
glucose and 25mM HEPES by passing through 100,1m cell strainers (BD 
Biosciences). Cells were stained with Hoechst 33258 (Sigma-Aldrich) to exclude 
dead cells. Because erythroid cells lose iMOS transgene expression as they differ- 
entiate (data not shown), they were excluded from the quantification by incubating 
cell suspensions with APC rat anti-mouse TER-119 antibody (BD Biosciences), 
which specifically reacts with cells of the erythroid lineage. Cell suspensions were 
analysed by flow cytometry in a BD FACSAria II SORP at 449, 488 and 633 nm. 
EYFP- and ECFP-expressing cells were quantified in the live, TER-119-negative 
population. Embryos were analysed separately. 

To quantify cell size, cells from E9.5 iMOS WT. Sox2-cre embryos and iMos!!™"; 
Sox2-cre embryos in wild-type, Myc*’~ and Myc '~ backgrounds were suspended, 
stained and analysed as described. Forward scatter pulse area (FSC-A) of EYFP and 
wild-type cells was calculated for each mosaic and for wild-type littermates, and the 
mean ratio was calculated. 

Embryo total DNA content. E9.5 Sox2-cre-recombined iMOS™””* embryos 
and wild-type littermates were collected and individually stored at —80 °C. 
Total DNA was isolated and purified following standard protocols and measured 
in Nanodrop for each embryo. 

Immunofluorescence and TUNEL assay. Embryos were obtained at different 
gestational stages, fixed overnight at 4 °C in 2% paraformaldehyde (PFA) in PBS, 
and whole-mount stained or, when E= 10.5, gelatin-embedded and cryosec- 
tioned. Immunofluorescence was performed following standard protocols. 
Primary antibodies used were anti-phospho-histone H3 (Ser 10) ‘mitosis marker’ 
and biotin-conjugated anti-phospho-histone H3 (Ser 10) (both from Millipore); anti- 
BrdU antibody (Becton Dickinson); anti-Myc polyclonal antibody (Millipore); anti- 
p35 rabbit polyclonal antibody (Orbigen); anti-E-cadherin antibody (DECMA-1) 
(Abcam); anti-cleaved caspase-3 (Asp 175) (SA1E) rabbit antibody and Alexa 
Fluor 488-conjugated anti-cleaved caspase-3 (Asp 175) antibody (both from Cell 
Signaling); anti-Bax (ab10813) and anti-Bax (6A7) (ab5714) antibodies (both from 
Abcam); anti-cleaved caspase-9 (Asp 353) antibody (Cell Signaling); anti-Bcl-xL 
(54H6) rabbit antibody (Cell Signaling); and anti-cleaved caspase-8 (Asp387) 
(D5B2) rabbit antibody (Cell Signaling). Mitotic cells and cells in the G2/M transition 
were detected by pH3 staining. pH3-positive nuclei exhibiting a discrete punctate 
pattern and showing an interphasic morphology were scored as G2/M cells. 

TUNEL staining was performed on whole-mount embryos or sections using 
terminal deoxynucleotidyl transferase (TdT) and biotin-16-2’-deoxy-uridine-5’- 
triphosphate (biotin-16-dUTP) (both from Roche), and developed with several 
streptavidin fluorescent conjugates (Jackson ImmunoResearch). 

BrdU labelling. To quantify cell proliferation of EYFP Myc-overexpressing and 
ECFP wild-type cells in iMOS''™; Sox2-cre mosaic embryos, pregnant females 
were intraperitoneally injected with 75 mgkg_' of BrdU (Sigma-Aldrich) at E6.5, 
E7.5 or E9.5, and embryos were extracted 1, 1h 30 min or 3h later, respectively. 
Embryos were fixed, treated with DNase I to cleave DNA and allow the binding of 
anti-BrdU antibody to BrdU, and immunostained as described above. 

ES cell derivation and assays. ES cells were derived from E3.5 blastocysts homo- 
zygous for iMOS™™ or iMOST' 1? P*° and RERT and cultured on mitomy- 
cin-C-inactivated mouse embryonic fibroblast (MEF) feeder layers in serum 
replacement (SR)-ES medium”. Cell lines were checked negative for mycoplasma 
and for normal karyotype at their establishment. For in vitro assays, MEF- 
depleted ES cells were seeded on fibronectin-coated 3.5-mm glass bottom dishes 
(MatTek) and cultured in 2X LIF SR-ES medium. Cells were treated with 20 uM 
4-hydroxytamoxifen (4-OHTAM) (Sigma-Aldrich) for 24h to induce a high 
recombination rate, or 5M 4-OHTAM for 4h for a low recombination rate. 
Cells were fixed overnight at 4 °C in 2% PFA in PBS at the time points detailed in 
each experiment. Immunofluorescence and TUNEL staining were performed as 
described. 

For conditioned medium experiments, homozygous iMos?!™ °;RERTES cells 
were induced as described in Supplementary Fig. 11b. Conditioned medium from 
these cultures was collected at t,. For protein synthesis assays, we used the Click- 
iT detection of protein synthesis assay kit (Life Technologies). Metabolic labelling 
and detection was performed according to the manufacturer on untreated and 
4-OHTAM-treated (10 1M for 24h at t,; Supplementary Fig. 11b) homozygous 
iMOS!!™Y";RERT ES cells using Click-iT AHA (L-Azidohomoalanine), Click-iT 
cell reaction buffer kit and Alexa Fluor 647 alkyne triethylammonium salt. Myc 
immunofluorescence was performed after the Click-iT detection reaction. Myc 
protein and metabolic labelling were quantified on confocal sections using the 
Metamorph platform as described below. 
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Three-dimensional embryo reconstruction. Three-dimensional embryo recon- 
struction and analysis of apoptosis frequency were performed in iMOS!Sox2- 
cre E6.5 embryos using Imaris X64 software. Visceral endoderm was digitally 
removed to allow better visualization of the epiblast. 

In situ hybridization. Whole-mount in situ hybridization was performed fol- 
lowing standard protocols. The Myc probe was provided by T. Rodriguez and the 
Oct4 probe by M. Manzanares. 

Quantification of Myc protein and protein synthesis. Quantitative image ana- 
lysis of confocal sections was performed with the Metamorph platform (Mole- 
cular Devices). To standardize quantification, we used medial confocal sections of 
E6.5 embryos, in which epiblast cells are mostly sectioned longitudinally and 
located in the embryo periphery. Several sections were quantified for each 
embryo. The nuclear area was defined from the DAPI image and Myc protein 
levels from the Cy3-developed Myc immunofluorescence image. Only nuclei 
showing an interphase morphology were scored, as Myc is delocalized during 
mitosis. Early apoptotic cells were defined by their cleaved caspase-3 or incipient 
TUNEL staining and the retention of normal cell and nucleus size. The DAPI 
images were signal-intensity thresholded, segmented and converted to 1-bit 
binary images using ImageJ. The DAPI mask was applied to the matching Myc 
image to obtain a combined image containing Myc fluorescence information for 
each nucleus. Myc fluorescence was measured as the ‘average grey value’ (the total 
grey value per nucleus) and expressed in arbitrary fluorescence units (AFU). 
Values were classified in regular linear intervals represented by a colour code. 
Myc fluorescence values were exported to Excel and frequency histograms, as well 
as curve interpolations, were generated. 

To validate this quantification system, E6.5 embryos containing different Myc 

doses were analysed under the same confocal acquisition settings, and raw inten- 
sity data were quantified without post-processing (Supplementary Fig. 18). A 
similar procedure was used in all Myc quantifications in embryos and Myc and 
protein synthesis quantifications in ES cells. 
Statistical analysis. The use of genetic mosaics, which allowed the comparison of 
randomly distributed cell populations within each embryo or ES cell colony, 
minimized usual sources of experimental variability between biological replicates. 
In experiments involving embryos we used a minimum of 4 embryos and scored a 
minimum of 1,000 cells per embryo, which allowed a minimum of 4 individual 
embryos for detection of at least 40% differences between population means at a 
power of 80% and a 0.05 alpha-level. 

To compare average percentages of ECFP cells between more than two groups, 
we used the ANOVA test for normal distributions or the Kruskal-Wallis test for 
non-normal distributions. If there was a significant difference between the group 
means, the Tukey HSD test was used to test differences among each pair of means. 
For comparisons of only two groups we used a f-test for normal distributions or a 
Mann-Whitney U-test for non-normal distributions. Population mean differ- 
ences from unity were tested with a one-sample t-test (normal distribution) or a 
Wilcoxon test (non-normal). All comparisons were made with Prism statistical 
software. 

The significance of population changes in ES cell cultures over time and 
increases in Myc protein and total protein synthesis were compared by proportion 


test. We also used these tests to determine percentages of apoptotic and prolife- 
rating cells in different ES cell populations. 95% confidence intervals from a bino- 
mial distribution are shown in all figures. 

To compare the distribution of Myc and protein synthesis levels across differ- 
ent genotypes or cell populations, we used generalized mixed linear models. In 
Fig. 4 we used a mixed linear model in which we considered the condition (WT/ 
iMos'™'"; Myc ‘~, Fig. 4h; apoptotic/non-apoptotic, Fig. 4j) as fixed and the 
embryo as random factor. We used the same model to test Myc level differences 
between pH3-positive and -negative cells (Supplementary Fig. 19) and between 
apoptotic and non-apoptotic cells detected by TUNEL staining (Supplementary 
Fig. 20). In all cases, a variance function was used to model the variance structure 
of the errors within the experimental groups”. To compare Myc and protein 
synthesis levels in the cell populations shown in Fig. 5e and Supplementary 
Fig. 22e, we used a linear model with this characteristic as a factor. Addi- 
tionally, in Fig. 5i we used Deming regression” to describe the correlation 
between Myc and protein synthesis. In Fig. 61 we analysed whether there was 
an interaction between differences in Myc levels in EYFP and ECFP populations 
and developmental stage. The fixed part of the model comprises the population 
effect, the developmental stage and the developmental stage interaction; the ran- 
dom part of the model is the embryo effect. A mixed model was also used to test 
differences between the distribution of Myc levels in apoptotic and non-apoptotic 
cells in ECFP and EYFP populations (Fig. 6g). In Fig. 6m and Supplementary Fig. 
18, we tested the distribution of Myc levels between different genotypes using a 
general linear model of the gamma family. Standard diagnostics checks on the 
residuals from the fitted models were used to test assumptions of normality and 
heterogeneity of variance. All statistical analyses were performed using the open- 
source statistical scripting language R version 2.11.0 (R Development Core Team, 
2006). Mixed linear model equations were fitted using the nlme R library*’. The 
total number of embryos and/or cells scored in each analysis is provided in the 
figure legends. 
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Integrative genomics identifies APOE ¢4 
effectors in Alzheimer’s disease 


Herve Rhinn'**, Ryousuke Fujita’**, Liang Qiang”, Rong Cheng’, Joseph H. Lee*? & Asa Abeliovich'? 


Late-onset Alzheimer’s disease (LOAD) risk is strongly influenced by genetic factors such as the presence of the apolipo- 
protein E <4 allele (referred to here as APOE4), as well as non-genetic determinants including ageing. To pursue mecha- 
nisms by which these affect human brain physiology and modify LOAD risk, we initially analysed whole-transcriptome 
cerebral cortex gene expression data in unaffected APOE4 carriers and LOAD patients. APOE4 carrier status was 
associated with a consistent transcriptomic shift that broadly resembled the LOAD profile. Differential co-expression 
correlation network analysis of the APOE4 and LOAD transcriptomic changes identified a set of candidate core regu- 
latory mediators. Several of these—including APBA2, FYN, RNF219 and SV2A—encode known or novel modulators of 
LOAD associated amyloid beta A4 precursor protein (APP) endocytosis and metabolism. Furthermore, a genetic variant 
within RNF219 was found to affect amyloid deposition in human brain and LOAD age-of-onset. These data implicate an 


APOE4 associated molecular pathway that promotes LOAD. 


Alzheimer’s disease is the most common form of dementia worldwide, 
and is characterized by cognitive decline with distinctive brain patho- 
logy that includes regional neuron loss, amyloid plaques and neurofi- 
brillary tangles’. Rare familial forms of Alzheimer’s disease are early in 
onset and caused by mutations in specific genes’. In contrast, the aeti- 
ology of common, non-familial LOAD appears much more complex 
and includes many genetic and environmental factors. The major consti- 
tuent of amyloid plaques is amyloid beta (AB), a proteolytic fragment 
derived from APP. The observation that familial-associated mutations 
typically modify AB production has led to formulation of the so-called 
‘amyloid hypothesis’, which posits that altered APP processing results 
in the generation of a toxic fragment that causes neurodegeneration. 

Recent genome-wide association studies (GWAS) have implicated 
common variants at approximately 10 genetic loci in LOAD*”. Indivi- 
dually, these variants have a modest effect on LOAD risk, with the 
notable exception of the APOE ¢4 allele (APOE4): heterozygosity of 
APOE4 is associated with a greater than threefold increase in LOAD 
risk, when compared to the common APOE £3 allele (referred to here 
as APOE3), and APOE4 homozygosity increases risk more than ten- 
fold. A third and more rare allele, APOE €2 (referred to here as APOE2), 
appears protective relative to APOE3 in that carriers of this allele are at 
reduced risk of LOAD relative to non-carriers. The APOE4 allele has 
been shown to modify the tertiary structure of APOE protein, but how 
this ultimately impacts LOAD is unclear. Various pathogenic mecha- 
nisms have been proposed for APOE4, including defects in the clear- 
ance and degradation of extracellular Af, in lipid and cholesterol 
trafficking, in APP metabolism, in inflammation and in other aspects 
of neuronal function and survival*”’. 

To pursue molecular pathways that underlie LOAD in an unbiased 
fashion, whole-transcriptome differential gene expression analyses have 
been used'*'”. However, gene expression changes in diseased brain tissue 
often reflect processes that are secondary to the disease pathology, 
such as cell loss, rather than causative events. Network approaches, 
including differential co-expression analysis (DCA)'**', have been deve- 
loped to disentangle causative events from secondary changes within 


transcriptome-wide gene expression data sets'**'. DCA is based on the 
notion that transcripts encoding causal ‘nodes’ or ‘master regulators’ 
of a disease process—whose activities are critically altered in a patho- 
logical state such as LOAD—can be identified by their co-expression 
correlation network properties, but not by simple differential gene 
expression analysis. We have broadened existing DCA tools to include 
consideration of genes other than those encoding known transcrip- 
tion factors*', with the rationale that DCA nodes would likely include 
upstream regulators of such factors. DCA ranks most highly those trans- 
cripts that are most altered in their co-expression correlation with the 
greatest number of differentially expressed transcripts in the context 
of a pathological state such as LOAD (Supplementary Fig. 1). 

Here we apply DCA to uncover regulatory mechanisms that affect 
LOAD and LOAD risk. We first show by differential gene expression 
analysis that a transcriptome-wide pattern of change is associated with 
the presence of APOEF4 in disease-free brain tissue, and that this pattern 
overlaps with the transcriptomic changes that distinguish LOAD affec- 
ted from unaffected brain tissue. Subsequent DCA identified candidate 
regulatory node genes predicted to mediate the common transcripto- 
mic changes observed in APOE4 carriers and LOAD patients. These 
genes include several known or novel modifiers of APP processing and 
endocytic trafficking such as APBA2, ITM2B, FYN, RNF219 and SV2A, 
suggesting a shared mechanism of action. Genetic or pharmacological 
modulation of these candidate regulatory nodes suppressed altered APP 
processing in cell models, including APOE4-positive human induced 
neurons (hiNs). Furthermore, meta-analysis of LOAD genome-wide 
association data indicated that common genetic variants within two 
such candidate regulatory nodes, FYN and RNF219, are predictive of 
LOAD age-of-onset in an APOE4-dependent manner. Taken together, 
our findings reveal an APOE4-dependent molecular pathway to LOAD 
and LOAD risk. 


A pre-LOAD state in APOF4 carriers 


To broadly pursue a molecular correlate of the APOE4 prodromal state 
in an unbiased manner, we initially re-evaluated publically available 
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transcriptome-wide gene expression data from cerebral cortex autopsy 
brain tissue of 185 individuals unaffected by LOAD, stratified by their 
APOE genotype (Supplementary Fig. 2, Gene expression omnibus 
(GEO) accession number GSE15222 (ref. 22)). Hierarchical clustering 
analysis revealed that the overall pattern of gene expression change 
associated with all APOE high-risk genotypes was closely related to the 
pattern of change associated with LOAD (relative to other neurologi- 
cal diseases; Fig. 1a). 

We next sought to characterize in detail the specific gene expression 
changes affected by APOEF4 carrier status and to contrast these with 
changes seen in LOAD brain. To avoid potential confounding effects, 
LOAD associated changes in brain gene expression were quantified 
independently of APOE variability (by limiting the analysis of LOAD 
impact to individuals homozygous for APOE3); conversely, the tran- 
scriptomic effects of APOE4 carrier status were evaluated independen- 
tly of the LOAD analysis (by limiting the study to LOAD-unaffected 
individuals and excluding APOE3 homozygous samples that were used 
in the LOAD analysis, Supplementary Fig. 3). Gene expression changes 
were highly correlated in the two comparisons (R = 0.62,P=4 10 '° 
by Wald statistics): out of 8,449 gene transcripts detected in all sam- 
ples, 215 were found to be significantly altered in expression by either 
APOE4 or LOAD status in the same direction, whereas 37 were signifi- 
cantly altered in opposite directions (Fig. 1b, c, Supplementary Fig. 4a 
and Supplementary Table 1). We note that similar analysis of the tran- 
scriptomic effect of ageing in LOAD-unaffected individuals (age >85, 
relative to age <75)—the major non-genetic risk factor for LOAD— 
revealed a high degree of overlap with LOAD-associated transcripto- 
mic changes, but not with the changes associated with APOE4 carrier 
status (Fig. 1c and Supplementary Figs 4b, c and 5). These data suggest 
that separate mechanisms underlie the effectof APOE status and age- 
ing on LOAD risk. 

Next, we hypothesized that the overlapping pattern of brain gene 
expression changes associated either with LOAD or with unaffected 
APOE high-risk status in brain tissue (termed the APOE4/LOAD 
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Figure 1 | Transcriptomic evidence of a pre-LOAD state in unaffected 
APOF4 brain tissue. a, Schematic of the overlapping patterns of 
transcriptome-wide cerebral cortex gene expression changes associated with 
LOAD or with APOE4 carrier status. b, Hierarchical clustering dendrogram 
demonstrates similarity between the transcriptome-wide gene expression 
changes associated with APOE4 unaffected carrier status and with LOAD. Also 
presented are the differential expression profiles of LOAD laser-microdissected 
neurons (LOAD-LMD), bipolar disorder cerebral cortex (BD), schizophrenia 
cerebral cortex (SZ), fronto-temporal dementia cerebral cortex (FTD), 
Huntington’s disease (in frontal cortex (HD 1) or in caudate nucleus (HD 2)) 
and Parkinson’s disease (in substantia nigra (PD 1), in frontal cortex (PD 2), or 
in putamen (PD 3)). Analyses used Gene Expression Omnibus (GEO) data sets 
GSE15222, GSE5281, GSE12649, GSE13162, GSE3790 and GSE20295. c, Venn 
diagram illustrating the overlap in transcriptome-wide gene expression 
changes in the context of APOE4, advanced age (>85 years old) or LOAD. 
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pattern; defined by a set of 215 transcripts) represented a signature 
ofa pre-LOAD prodrome state (Fig. 1b, cand Supplementary Fig. 3b). 
Consistent with this, the APOE4/LOAD pattern was highly correlated 
with changes seen in transition from unaffected LOAD-free tissue to 
incipient-LOAD tissue in an independent data set of cortical tissue 
transcriptome expression profiles (R = 0.36, P=9 X 10 * by Wald 
statistics; rather than the transition from incipient to moderate, or 
from moderate to severe disease, R = 0.04 and 0.02, respectively; Sup- 
plementary Fig. 4d). 


APOE4 and LOAD differential co-expression analysis 


Transcriptome-wide differential gene expression analyses, as descri- 
bed above for APOE4 or LOAD, provide a broad and unbiased mole- 
cular perspective of a tissue. However, mechanistic interpretation of 
individual gene expression changes is challenging, as such changes 
may represent indirect, downstream consequences of pathology. 
To address this issue, we applied DCA—a tool for the identification 
of candidate regulatory node elements that play causal roles in the 
context of transcriptome-wide gene expression network changes—to 
the APOE4/LOAD pattern of change (Fig. 2a—c and Supplementary 
Figs 6-9). Among the 20 most highly ranked candidate node elements 
identified by DCA, we noted that several had previously been impli- 
cated in the processing and intracellular sorting of APP, includ- 
ing APBA2 (ref. 23), ITM2B (ref. 24), TMEMS59L (ref. 25) and FYN 
(ref. 26), suggesting a shared mechanism of action. Consistent with this 
notion, another top DCA hit, SV2A, encodes a well-described regulator 
of neuronal endocytosis. Furthermore, a selective SV2A inhibitor, the 
anti-epileptic levetiracetam”’, was recently reported to suppress patho- 
logical neuronal hyperactivity in the hippocampus of individuals with 
mild cognitive impairment (MCI), which precedes LOAD”. The top- 
ranked DCA hit, RNF219, had not previously been linked to LOAD, 
but common genetic variants at the RNF219 locus had been associated 
with alternations in lipid metabolism”, cognitive performance” and 
central nervous system ventricle volume”’, akin to pathological asso- 
ciations ascribed to APOE4. 

We sought to test whether DCA candidate node genes (Fig. 2b) 
may broadly function as regulators of APP processing in an APOE4- 
dependent manner. To functionally evaluate DCA-identified genes, 
we initially used a simple in vitro model system in which mouse neuro- 
blastoma N2a cells that overexpress a human APP transgene (N2a-APP 
cells) were treated with exogenous APOE protein variants (100 pg ml * 
for 18 h). In this APP-N2a model system, treatment with recombinant 
human APOE ¢4 variant protein (rhAPOE4), but not APOE «2 or 
APOE £3 protein (rhAPOE2 or rhAPOES3), significantly increased the 
levels of extracellular AB40 and AB42 (Supplementary Fig. 10a). Indi- 
vidual knockdown of several DCA hits by transfection of vectors 
encoding appropriate small hairpin RNAs (shRNAs; Supplementary 
Fig. 11a), including RNF219, SV2A, HDLBP, ROGDI, CALUand PTK2B, 
suppressed the induction of AB40 and AB42 levels in the context of 
rhAPOE4 treatment (Fig. 2d and Supplementary Figs 10b-d, 11b-h). 
With the exception of HDLBP, transfection of these knockdown vec- 
tors did not affect AB levels in the absence of rhAPOE4 (in rhAPOE2- 
or vehicle-only treated cultures). Thus, the DCA-identified node hits 
included many known or novel potential modifiers of APOE4-dependent 
AB accumulation. 


RNF219 and SV2A modulate APP processing 


We next investigated mechanisms by which DCA-identified genes may 
mediate the impact of APOE4 on APP processing, and focused on the 
top two DCA hits: RNF219 and SV2A. Cleavage of APP by the BACE1 
B-secretase, a rate-limiting step in AB production”, generates both a 
soluble extracellular fragment, sAPP, and a transmembrane carboxy- 
terminal fragment (B-CTF), termed C99. rhAPOE4 treatment of N2a- 
APP cells increased levels of both sAPPP in cell media and C99 fragment 
in cell lysates (Fig. 2e, f and Supplementary Fig. 12). Such effects were 
suppressed by knockdown of either RNF219 or SV2A, pointing to 
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Figure 2 | Differential co-expression correlation analysis of the APOE4- 
and LOAD-associated transcriptomic states. a, Schematic of the DCA model. 
The pattern of node gene (red) co-expression with differentially expressed 
genes (yellow) is reconfigured in the context of APOE4 or LOAD tissue. 

b, Candidate node regulatory genes that underlie the APOE4/LOAD pattern of 
expression changes, rank-ordered by DCA. c, Heat maps representing the 
co-expression correlation coefficients for RNF219 or SV2A with representative 
genes from the APOE4/LOAD pattern set (in rows) as a function of APOE 
genotype and LOAD status (in columns; Und, LOAD-unaffected). High 
co-expression correlations are denoted in red, high anti-correlations in blue. 
d-f, rhAPOE4 treatment of N2a-APP cells induced the accumulation of APP 
metabolites including AB40, AB42 (d) and sAPP® (e) in extracellular media, 
and C99 fragment (f) in cell lysates. Such accumulations were suppressed with 
transfection of shRNA vector-mediated knockdown of RNF219 or SV2A 
(RNF219-KD or SV2A-KD, respectively) (relative to non silencing vector 
control (Non Sil)). Error bars are s.e.m.; n = 12, 8 and 8 (d, e), and 6, 5 and 6 
(f) independent wells per group for the Non Sil, RNF219-KD and SV2A-KD 
groups, respectively. *P < 0.05; analyses by ANOVA followed by Student- 
Newman-Keuls (SNK) test. 


increased BACE1 processing as a primary mechanism for the impact 
of APOE4 on AB accumulation, and a regulatory mediator role for SV2A 
and RNF219 in this context. Neither APP holoprotein nor BACE1 
levels were altered in the context of rhAPOE4 treatment or knock- 
down of these DCA hits. 

BACEI cleavage of APP is most efficient in acidic intracellular com- 
partments such as early and late endosomes”, and thus one potential 
mechanism for APOE4 action on APP processing would be through 
altered APP internalization’. We quantified the effect of DCA candi- 
date node knockdown on APP and BACE1 subcellular co-localization 
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in the context of rhAPOE4 stimulation, using either immunocyto- 
chemistry (ICC), or by cell-surface protein analysis through selective 
biotinylation and fractionation. rhAPOE4 treatment of N2a-APP cells 
shifted both APP and BACE1 localization away from the cell surface and 
toa common endocytic compartment, leading to increased co-localization 
(Fig. 3a—d). Such rhAPOE4-induced endocytic co-localization of APP 
and BACE] was suppressed by knockdown of the DCA node genes 
RNF219 or SV2A. Knockdown of RNF219 appeared to selectively inhi- 
bit the internalization of APP in the presence of rhAPOE4, whereas 
SV2A knockdown appeared to have a broader effect on APP interna- 
lization, even in the absence of rhAPOE4. Additionally, SV2A (but not 
RNF219) knockdown suppressed the internalization of BACE] in an 
APOE4-dependent manner (Fig. 3a—d and Supplementary Figs 13, 14). 

We further investigated the roles of SV2A and RNF219 in the con- 
text of overexpression studies. Transfection of N2a-APP cells with a 
vector encoding an SV2A enhanced green fluorescent protein fusion 
protein (SVA2-EGFP)** led to increased accumulation of sAPPB, AB40 
and Af42 species, regardless of the presence of rhAPOE4 (Fig. 3g and 
Supplementary Fig. 15d, e). Importantly, mutation of a critical tyro- 
sine residue within the endocytosis motif of SV2A, Y46A (ref. 33), that 
selectively impairs SV2A association with clathrin adaptor proteins 
but does not affect other activities of SV2A (ref. 33), abrogated the 
effect of SV2A-EGFP overexpression on APP processing. Furthermore, 
whereas wild-type SV2A-EGFP co-localized with APP and BACE1, 
such co-localization was reduced with the Y46A mutant of SV2A-EGFP 
(Supplementary Fig. 15). These data support a direct role for SV2A- 
mediated endocytosis in the regulation of APP processing. In contrast 
with SV2A-EGEP, overexpression of wild-type RNF219-EGFP fusion 
protein in N2a-APP cells did not appear to significantly alter APP pro- 
cessing. However, overexpression of modified RNF219-EGFP forms 
that harbour RING-domain missense mutations at highly-conserved 
cysteines RNF219(C18A/C21A)-EGFP, or a RING-domain deletion 
RNE219(A1-77)-EGFEP, suppressed the induction of AB40, AB42 and 
sAPPB levels selectively in the context of rhAPOE4 (Fig. 3g, h and Sup- 
plementary Fig. 15). Such RNF219 mutants may act in a dominant- 
negative fashion to affect APP endocytosis and processing. Taken 
together, these data further implicate DCA candidate nodes in mediating 
the effect of APOE4 on APP processing through modified endocytosis. 


Levetiracetam corrects APOE4 phenotypes 

We next hypothesized that treatment with levetiracetam, a selective 
SV2A inhibitor” used clinically to treat seizure disorders, could correct 
APOEF4 related alteration in APP processing. Consistent with this, leve- 
tiracetam significantly decreased extracellular AB42 and AB40 levels 
in rhAPOE4 exposed N2a-APP cells (Supplementary Fig. 16a-c). To 
extend our analyses to a more physiological context, we next used a set 
of hiN cultures™, derived from human skin fibroblasts of APOE4 car- 
riers or non-carriers. hiN cell cultures from APOE4 carriers displayed 
increased APP processing to AB42 and AB40 and increased APP co- 
localization with BACEI, relative to non-carriers (Fig. 4), consistent 
with the N2a studies above. Levetiracetam treatment suppressed the 
accumulation of AB40 or AB42 species, as well as APP co-localization 
with BACE1, selectively in the context of APOE4 carrier cultures. Thus, 
SV2A is required for the APOE4 mediated induction of APP proces- 
sing in the context ofhuman induced neurons that express endogenous 
levels of these proteins. 


RNF219 genetically interacts with APOE4 

The above studies indicated that DCA-identified candidate node genes 
may function as effectors of APOE4, leading ultimately to LOAD. 
Thus, a prediction is that common human genetic variants at DCA 
gene loci may impact the association of APOE4 with LOAD risk or age- 
of-onset. To this end, single-nucleotide polymorphisms (SNPs) within 
50 kilobases of the 20 top-ranking DCA identified genes were each 
evaluated for their genetic interaction with APOE4 in modulating 
LOAD age-of-onset, through a meta-analysis of 4 publicly available 
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Figure 3 | RNF219 and SV2A modulate APP proteolytic processing and 
localization in an APOE4-dependent manner. a-d, ICC analysis of APP and 
BACE1 co-localization in N2a cells in the context of APOE4 or vehicle, and 
with transfection of knockdown (KD) vectors for RNF219, SV2A or non 
silencing vector control (Non Sil); 1 = 25, 24,16,18, 30 and 36 cells in 3 
independent wells per group, for the groups Non Sil plus vehicle; Non Sil plus 
rhAPOE4; RNF219-KD plus vehicle; RNF219-KD plus rhAPOE4; SV2A-KD 
plus vehicle, and SV2A-KD plus rhAPOE4, respectively. Scale bar, 5 jim. 

e, f, Biochemical quantification of cell-surface APP upon RNF219 or SV2A 
knockdown in rhAPOE4-treated N2a-APP cells. a.u, arbitrary units. n = 6, 5 
and 6 independent wells for the Non Sil, SV2A-KD and rhRNF219-KD groups, 
respectively. g, h, AB40 levels in media of N2a-APP cells in the context of 
RNF219 (g) or SV2A-EGFP (h) overexpression. n = 24, 22, 12, 12, 18 and 18 
independent wells for the groups Non Sil plus vehicle, Non Sil plus rhAPOE4, 
RNEF219(wild-type) plus vehicle, RNF219(wild-type) plus rhAPOE4, 
RNF219(A1-77) plus vehicle, and RNF219(A1-77) plus rhAPOE4 
respectively; n = 24, 22, 12, 12, 18 and 18 independent wells for groups Non Sil 
plus vehicle, Non Sil plus rhAPOE4, SV2A-EGFP plus vehicle, SV2A-EGFP 
plus rhAPOE4, SV2A(Y46A)-EGFP plus vehicle, and SV2A(Y46A)—EGFP 
(g) respectively. For all figures, error bars are s.e.m., *P < 0.05 by ANOVA 
followed by Tukey HSD (a) or SNK test (c, d, f-h). 


GWAS data sets: GGnADA (875 LOAD cases and 850 controls**), TGEN 
Discovery (446 LOAD cases and 290 controls**), TGEN Replication (197 
LOAD cases and 114 controls**), and ADNI (180 LOAD cases and 
214 controls*’). SNP variants at the FYN and RNF219 loci were asso- 
ciated with significantly decreased LOAD age-of-onset in APOE4 
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Figure 4 | SV2A inhibition in human induced neurons modifies APP 
processing in an APOE4-dependent manner. a, AB40 and AB42 
quantification in media from APOE ¢€3/e3 (E3) or APOE €3/e4 (E4) hiN 
cultures treated with levetiracetam (Lev; 1 }1M) or vehicle-only. Results 
represent the means = s.e.m. (m = 6 and 12 independent wells for E3 and E4 
groups, respectively). *P << 0.05. Analysis by ANOVA followed by post-hoc 
Tukey HSD. b, c, ICC analysis of APP and BACE1 co-localization in APOE3 
and APOE4 hiN cultures treated with levetiracetam (1 1M) or vehicle-only. 
Insets show high-magnification images for visualization of BACE1 and APP 
co-localization. Means = s.e.m. are presented (c). n = 40 (in 4 independent 
wells), 46 (6 independent wells), 35 (3 independent wells) and 48 

(6 independent wells) cells in the groups E3 plus vehicle, E3 plus levetiracetam, 
E4 plus vehicle and E4 plus levetiracetam, respectively. *P < 0.05 by ANOVA 
followed by post-hoc analysis with Tukey HSD test. Scale bar, 5 jum. 


non-carriers, but not in APOE4 carriers (Bonferroni corrected P 
values = 2.93 X 10 * and 6.03 X 10°, respectively; Fig. 5a, b, Sup- 
plementary Fig. 17a and Supplementary Tables 2-4), consistent with a 
genetic interaction between these DCA hits and APOE4. More 
broadly, the set of genetic loci that encode DCA ‘hits’ was found to 
be enriched for genetic variants modulating LOAD age-of-onset in an 
APOE4-dependent fashion, as quantified using a gene-set enrichment 
algorithm (Supplementary Fig. 18 and ref. 38). We note that this was 
not the case for genes that were identified by simple differential gene 
expression analysis only, supporting the notion that DCA more effi- 
ciently identifies genes with a causal role. 

Given the role of RNF219 in APP processing in vitro, we next hypo- 
thesized that genetic variants at RNF219 that are associated with 
LOAD, such as rs2248663, may also affect central nervous system AB 
accumulation in vivo even in unaffected carriers. To this end, we ana- 
lysed Florbetapir ('*F) positron emission tomography (PET) imaging, 
which quantifies central nervous system AB amyloid load”, in a data 
set of 206 genotyped unaffected elderly individuals (cognitively intact; 
within the Alzheimer’s Disease Neuroimaging Initiative, ADNI°””’). 
As previously described*, APOEF4 carrier status was strongly associ- 
ated with increased AB amyloid load across multiple brain regions 
including cingulate, frontal, parietal and temporal cortex. Strikingly, 
the RNF219 rs2248663 minor allele was similarly associated with increa- 
sed AB amyloid load within many brain regions including cingulate, 
frontal, parietal and temporal cortex and brainstem, but not cerebel- 
lum (n = 177; Supplementary Table 5). When stratified by APOE geno- 
type, the RNF219 genotype significantly affected Af load in unaffected 
APOE4 non-carriers (€3/¢3; n = 129; Fig. 5d). Such an association was 
not apparent in APOE4 carriers (¢3/¢4; n = 44), but could not be entirely 
excluded due to the more limited power of analysis in the APOE4 carrier 
subset. In contrast to such positive findings in unaffected cognitively 
intact individuals, AB amyloid load was not significantly associated 
with RNF219 genotype in affected individuals diagnosed with mild 
cognitive impairment (MCI; n = 382 total from the ADNI cohort”, 
Supplementary Table 5), regardless of APOE genotype. This is con- 
sistent with a selective role for RNF219 that precedes the onset of 
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Figure 5 | An RNF219 polymorphism modifies LOAD age-of-onset and 
amyloid deposition in human brain. a, Manhattan plot of the RNF219 locus 
presenting the P value (log-scale, on x-axis) of the interaction between APOE4 
and individual SNPs proximal to the the RNF219 gene (in blue, vertical bars 
represent exons) in the LOAD age-of-onset meta-analysis. b, Mean LOAD 
age-of-onset across the TGEN, GenADA and ADNI data sets. Error bars 
represent s.e.m.; 1 = 339, 67, 530 and 102 individuals for the groups: non 
APOEF4 carrier with RNF219 rs2248663(A); non APOF4 carrier with RNF219 
1s2248663(G); APOE4 carrier with RNF219 rs2248663(A); and APOE4 carrier 
with RNF219 1s2248663(G). **P < 0.01 by ANOVA followed by Tukey HSD. 
c-e, Mean AB amyloid load in cognitively intact elderly individuals from the 
ADNI cohort, as quantified by florbetapir-PET within cingulate cortex 

(c), parietal cortex (d) or cerebellum (e). n = 129, 44, 41 and 7 individuals for 
the groups APOE3 carrier with RNF219 1s2248663(A); APOE3 carrier with 
RNEF219 1s2248663(G); APOE4 carrier with RNF219 rs2248663(A); and APOE4 
carrier with RNF219 rs2248663(G), respectively. Error bars represent 
s.e.m.;*P < 0.05, ***P < 0.001 by ANOVA followed by Tukey HSD. 


LOAD pathology. Taken together, these findings support a genetic 
interaction between RNF219 and APOE in the context of AB amyloid 
load in LOAD-unaffected human brain, as well as with respect to 
LOAD age-of-onset. 


Discussion 


Using whole-transcriptome analysis of brain gene expression as a start- 
ing point, we describe molecular correlates of a prodromal pre-LOAD 
state present even in the context of unaffected APOE4 carriers. Such a 
prodromal molecular signature is consistent with reports that, even in 
LOAD-unaffected individuals, APOE4 is nonetheless associated with 
increased fibrillar AB accumulation in the brain*', decreased cerebro- 
spinal fluid AB42 levels*, and accelerated cognitive decline*’. As most 
APOE4 carriers do not go on to develop LOAD, it is likely that addi- 
tional unknown ‘second hits’ are at play. 

The candidate mediators identified by DCA herein show limited 
overlap with prior LOAD transcriptome network analyses, perhaps 
reflecting differences in experimental design'’***”’. For instance, most 
prior studies have used co-expression approaches primarily to iden- 
tify clusters of functionally related genes, and subsequently compared 
the aggregated expression levels of such functional clusters in patient 
versus unaffected tissue. In contrast, DCA herein was used to identify 
individual candidate node genes that are highly altered in their patterns 
of co-expression with all other transcripts in affected tissue. Furthermore, 
we specifically pursued mechanisms that underlie the early causal events 
in LOAD, that would be apparent even in unaffected APOE4 carriers 
at high risk for LOAD; in contrast, prior network models have often pur- 
sued co-expression changes that correspond to disease progression”. 

Levetiracetam was recently shown to improve cognition and reduce 
hippocampal hyperactivity in individuals with amnestic mild cognitive 
impairment (aMCI)**. Such increased hippocampal activation has also 
been described in asymptomatic individuals who carry the APOE4 
allele**. Levetiracetam treatment of APP transgenic mice has similarly 
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been reported to improve cognitive function, whereas other antiepilep- 
tic agents fail to do so”. We note that in the absence of transgenic human 
APOE4, levetiracetam treatment did not modify APP processing in 
mice”, consistent with the in vitro findings herein. Although the 
therapeutic action of levetirecatem in patients with aMCI was ascribed 
to a general suppression of neuronal hyperactivity’, our data point to 
a more selective molecular mechanism of action for levetirecatem at 
SV2A. More broadly, it will be of interest to apply the integrative 
genomics approaches, as described here, to additional neurological 
and psychiatric brain disorders. 


METHODS SUMMARY 


Differential expression and co-expression analysis were done using the R Biocon- 
ductor package essentially as described previously”. For knockdown experiments, 
N2a cells stably expressing human wild-type APP (N2a-APP) were transfected in 
Opti-MEM1 (Invitrogen) media containing 3 1 of Lipofectamine 2000 (Invitrogen) 
and an appropriate shRNA expression plasmid (1 jg). Cells were subsequently 
maintained in DMEM supplemented with 10% FBS, 500 pg ml puromycin (A.G. 
Scientific) and 500 jig ml’ geneticin (Invitrogen) for at least 1 week before phe- 
notypic studies. AB quantification by enzyme-linked immunosorbent assay (ELISA) 
and APP-BACE1 ICC were performed as previously described**. Levels of APP 
holoprotein and BACE1 enzyme at the cell surface were quantified by cell surface 
biotinylation, fractionation and isolation with Avidin beads, using a Cell Surface 
Protein Isolation Kit (Pierce) followed by western blot. Human skin fibroblast cul- 
tures were obtained from de-identified, banked tissue samples (STC0022: Columbia 
University Institutional Review Board, IRB #AAAD3566; Primary Investigator 
Lawrence S. Honig; T-4560: Columbia University Taub Institute New York Brain 
Bank)**. Neurons were induced from the fibroblast cultures by transduction with 
replication-incompetent lentiviral particles encoding the neurogenic factors Ascl1, 
Brn2, Zicl and Mytll, in fibroblast media containing polybrene (8 1g ml ')**. 
hiNs were then cultured in glial-conditioned N2 media containing 20ng ml! 
BDNF (Peprotech) and supplemented with dorsomorphin (1 1M; Stemgent) for 
4-6 days. Age-of-onset genetic analysis were done by linear regression of age-of- 
onset as a function of gender, APOE status and the queried genotype, allowing for 
an interaction term between APOE and the queried SNP minor allele load using 
gPLINK®™ software. For human Af load analysis, Florbetapir (AV-45) PET-scan 
data was obtained from the ADNI consortium (https://ida.loni.ucla.edu), quanti- 
fied as standardized uptake value ratio (SUVR)”. The effect of rs2248663 or APOE 
genotype on SUVR was evaluated using an additive model within the PLINK 
linear function, and adjusted for gender and age. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Materials. Levetiracetam was obtained from LKT Laboratories, puromycin was 
purchased from AG Scientific. Recombinant human APOE2 (rhAPOE2), APOE3 
(rhAPOE3), and APOF4 (rhAPOE4) were from BioVision. Antibodies obtained 
were anti-APP C terminus and anti-RNF219 (both rabbit polyclonals, Sigma- 
Aldrich), anti- APP N-terminal (mouse monoclonal, 22C11, Millipore), anti-APP 
C-terminal (mouse monoclonal, 2.F2.19B4; from Millipore, for detection of the 
exogenous human wild-type APP transgene and cleavage products), Anti-APP 
C-terminal (mouse monoclonal, c1/6.1; Covance, for detection of the endogenous 
murine APP and cleavage products), anti-beta amyloid (mouse monoclonals, 4G8, 
6E10; Covance), anti-BACE1 (rabbit polyclonal; Invitrogen) and anti-RNF219 
(rabbit polyclonal; Santa Cruz Biotechnology). 

Cell culture and shRNA expression vector transfection. N2a cells stably expres- 
sing human APP wild type (N2a-APP) were maintained in DMEM supplemented 
with 10% FBS and 1% penicillin 1% streptomycin and 500-1,000 1g ml! geneti- 
cin (Invitrogen). For knockdown experiments, cells grown overnight to 70-90% 
confluency in 24-well plates were transfected in Opti-MEM1 (Invitrogen) media con- 
taining 3 pl of Lipofectamine 2000 (Invitrogen) and 1 jig of shRNA expressing plas- 
mid DNA and incubated in DMEM supplemented with 10% FBS and 500 pg ml? 
puromycin (A.G. Scientific) and 500 pg ml! geneticin (Invitrogen) at least for 
1 week before further experiments. Validated shRNA plasmids were obtained from 
Open biosytems (pGIPZ backbone): SV2A (#1:V3LMM_450995, 5’-TATTGTGC 
AGGAACGTGCT-3'; #2: V3LMM_450996, 5’-TGATGAACGTGCAGTTGCG-3’; 
#3:V3LMM_451000, 5’-TGCGGAAGAATGTGTTGCT-3’), RNF219 (V3LMM_ 
483077, 5’-ATCACTTCGTTCTAGAGCT-3’) or Sigma (pLKO backbone): NCDN 
(TRCN0000119418, 5'’-CGTAGGATCTTTGATGCCGTT-3’), CALU (TRCN00 
00114827, 5'-GCTCAGCGATAAAGTTCACAA-3’), FYN (TRCN0000023379, 
5'-GCTCGGTTGATTGAAGACAAT-3’), TMEM95L (TRCN0000178059, 5'- 
GCTTATTGACATTCCTCACAT-3’), MAPK3 (TRCN0000023186, 5’-CCATGA 
GAATGTTATAGGCAT-3’), HDLBP (TRCN0000105171, 5'-GCTCGCATTAA 
GAAGATTTAT-3’), RNF219 (TRCN0000125455, 5’-CCACGAGATGAGTGA 
AGATTT-3’), ROGDI (TRCN0000012233, 5’-CCCTCCTCATTCCCTGTGGTA-3’). 
AB ELISA. AB quantification was performed by ELISA as described previously***’. 
Media was conditioned for 48 h before harvesting. Samples were analysed for AB40 
or AB42 using specific sandwich ELISAs. Briefly, AB40, and AB42 were captured 
using monoclonal antibodies targeted against amino acids 35-40 (HJ2.0), or 33-42 
(HJ7.4) of AB, respectively. The antibodies HJ2.0, HJ5.1 and HJ7.4 were gifts from 
D. M. Holtzman. For AB40 and AB42 assays, a biotinylated central domain mono- 
clonal antibody (HJ5.1) followed by streptavidin-poly-HRP-40 was used for detec- 
tion (Sigma). All assays were developed using Super Slow ELISA TMB (Sigma) and 
read on a VersaMax ELISA Microplate Reader (Molecular Devices) at 650nm. 
ELISA signals were reported as the mean ~ s.e.m. of three replica wells in ng of AB 
per ml of supernatant, based on standard curves using synthetic human and mouse 
recombinant AB40 and AB42 peptides (rPeptide; Bogart, GA). Samples were opti- 
mized to detect AB40 and AB42 in the range of 1-3,000 and 0.03-30 ng ml}, 
respectively’. The amount of AB was normalized to total cell protein levels. 
Quantitative real time RT-PCR. qRT-PCR was done as described previously”. 
mRNA levels were quantified using AACt methods, using GAPDH as a normali- 
zing factors. Forward/reverse primers pairs used were 5'-AGGTCGGTGTGAA 
CGGATTTG-3'/5'-GGGGTCGTTGATGGCAACA-3’ for GAPDH, 5'-GTGCCA 
CATTTGCTTGGGG-3'/5'-TCAGGAGTGATCGGGACTCTG-3’ for RNF219, 
5'-AGGTGAAAGGGACTCTGACTC-3'/5'-GCCAAGTGCAACAACTGGT-3’ 
for ROGDI, 5'-AATGCTGATGGGTTCATTGATCT-3'/5'-GGTTCTTATCTC 
GAAACTCCACG-3’ for CALU, 5'-ACAGTGCAGACAGAGATCCAG-3'/5'- 
CGGATTTGAAGGTCATACCTCCA-3’ for PTK2B, 5'-CTCAGGTGTTTCAT 
GTACCCC-3'/5'-TCAAGGCAGATTTTTGCTTGTTC-3’ for HDLBP, 5'-GGT 
TCACCTTTCCCAACCGA-3'/5'-GCTGGCTAGTTCAGGGTCG-3’ for NCDN 
and 5'-TCCGCCATGAGAATGTTATAGGC-3’/5'-GGTGGTGTTGATAAGC 
AGATTGG-3’ for MAPK3. 

Immunohistochemistry. Immunocytochemistry (ICC) was performed as prev- 
iously described**. Cells were fixed with 4% paraformaldehyde for 10 min at room 
temperature, followed by rinsing 3 times with phosphate-buffered saline (PBS). 
Cells were then permeabilized with 0.3% Triton X-100 in 1X PBS for 2 min at 
room temperature. After rinsing three times with PBS, cells were incubated with 
blocking buffer containing 1% BSA and 1% normal goat serum at room tempera- 
ture for 1h. All primary antibodies were diluted in blocking buffer. Cells were 
incubated with primary antibodies (1:200 to 1,500) at 4 °C for 12-16 h, followed 
by the corresponding secondary antibody solutions at room temperature for 1h. 
Cells were rinsed with 1X PBS three times followed by mounting on coverslips 
with Fluoro-Gel (Electron Microscopy Sciences). Detailed antibody sources and 
dilutions used can be found in the Supplementary Information. Imaging was 
conducted by laser-scanning confocal microscopy with a X63 (1.4 NA) objective 
(LSM510, Carl Zeiss). Cell counts and fluorescence intensities were quantified in 
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10 to 35 images of randomly selected views per well. Subsequently, images were 
analysed for cell counts and fluorescent intensity using Image J 1.42q software 
(National Institutes of Health). 

Immunoblotting. Cells were suspended in RIPA buffer contained protease inhi- 
bitor cocktail (Sigma) and sonicated, then cells were incubated for 1 h at 4 °C. The 
lysates were cleared by centrifugation at 10,000g for 10 min at 4°C. Protein con- 
centration was determined by Bio-Rad Dc Protein assay kits (Bio-Rad), 5-30 pg 
of total protein lysate was resolved on a 4-12% SDS-PAGE gel. Protein samples 
were transferred onto nitrocellulose membranes using the semi-dry transfer unit 
(Owl Scientific) and blocked with 3% skimmed milk in Tris buffered saline with 
0.1% Tween 20 for 1h. The primary antibodies were incubated overnight at 4°C, 
and primary antibody binding was detected using horseradish peroxidase con- 
jugated anti-mouse IgG at 1:5,000 dilution (Jackson Immunoresearch Labora- 
tories) or horseradish peroxides conjugated anti-rabbit IgG at 1:5,000 dilution 
(Jackson Immunoresearch Laboratories). 

Cell surface biotinylation. Levels of APP holoprotein and BACE1 at the cell 
surface were quantified using cell surface biotinylation followed by fractionation 
and western blotting. Cell surface biotinylation was performed using a cell surface 
protein isolation kit following the manufacturer’s instructions (Pierce). N2a cell 
cultures in two 10-cm dishes at 90-95% confluence were first rinsed three times 
with ice-cold culture medium and then incubated in PBS containing 1 mg ml * of 
EZ-link Sulfo NHS-LC biotin (Pierce Chemical), or PBS alone, for 30 min at 4 °C. 
To quench any excess biotin, cell cultures were rinsed and then incubated in 
quenching solution for 10 min at 4 °C. Next, cells were gently scraped into conical 
tubes and centrifuged. Pellets were washed in cold PBS three times, and lysis 
buffer with protease inhibitor at 4°C added as per manufacturer’s instructions. 
Cells were sonicated for disruption and incubated 30 min on ice. After incubation, 
sample were centrifuged 10,000g for 2 min and supernatants were collected. Cell 
lysates from biotin-labelled and unlabelled samples were incubated with NeutrAvidin- 
agarose beads (Pierce Chemical) for 60 min at room temperature. Subsequently, 
beads were washed with RIPA buffer containing protease inhibitor cocktail (Sigma), 
and total protein levels were determined using a BCA protein quantification assay 
kit (Bio-Rad). For western blot analysis, lysates were mixed with sample buffer, 
boiled for 10 min, and then resolved on an SDS-PAGE gel. Total protein, contain- 
ing 15 pg of lysate was simultaneously resolved on the gel, transferred to PVDF 
membrane and probed for APP and BACE] expression. 

Human skin fibroblasts. Human skin fibroblast cultures used in this study were 
obtained from de-identified, banked tissue samples; there was no interaction with 
subjects, no intervention, and private, identifiable information was not collected. 
One fibroblast line was obtained from an unaffected individual homozygous for 
the APOE3 allele (STC0022 (female, 65 years old)), and a second line from an 
unaffected individual who carries one APOE4 and one APOE3 allele (T-4560 
(male, 89+ years old)). STC0022 and T-4560 were obtained from de-identified, 
banked tissue samples (STC0022: Columbia University Institutional Review Board, 
IRB #AAAD3566; Primary Investigator. Lawrence S. Honig; T-4560: Columbia 
University Taub Institute New York Brain Bank)*. Human skin fibroblasts were 
cultured in standard fibroblast media (DMEM with 10% FBS). 

hiN cell induction and transfection. Fibroblasts were plated at 20,000 cells per 
well in 24-well plates one day before infection. Culture plates and dishes were 
treated with poly-L-ornithine (Sigma) and laminin (Invitrogen) or poly-p lysine 
(Trevigen) and laminin before the application of the cells as per the manufac- 
turer’s instructions. Fibroblasts were transduced with replication-incompetent, 
VSV¢g-coated lentiviral particles encoding Ascll, Brn2, Zic 1 and Mytll, in fibro- 
blast media containing polybrene (8 1g ml’). Each lentiviral type was added at a 
multiplicity of infection ~2:1. Two day after transduction, the media was replaced 
with glial-conditioned N2 media containing 2 ng ml_' BDNF (Peprotech). For the 
first 4-6 days in N2 media, dorsomorphin (1 1M; Stemgent) was applied to the 
culture. Media was changed every 2-3 days for the duration of the culture period. 
Differential expression. The microarray data set was downloaded from the Myers 
laboratory website. All subsequent data manipulations and analyses were done 
using R Bioconductor package. For each gene, differential expression between two 
conditions was assessed by a two-tailed t-test, using a threshold P-value of 0.05. 
For a gene to be considered affected by both APOE genotype and AD phenotype, 
its expression level has to be significantly changed in the same direction when 
comparing samples from unaffected APOE4 individuals to unaffected APOE2 
individuals and when comparing samples from AD APOE3 patients to unaffected 
APOEB3 individuals. Genes annotation enrichment were queried using GSEA* or 
the DAVID bioinformatics resources***». 

Differential wiring analysis. The microarray data set was downloaded from the 
Myers laboratory website. All subsequent data manipulations and analyses were 
done using R Bioconductor package. Correlations between gene expression levels 
were assessed using cosine similarity on log-transformed levels; briefly, two genes 
whose expression levels are simultaneously high or low across many samples are 
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in phase and will have a correlation coefficient close to 1. On the contrary, if one 
gene shows high expression levels when another one shows low across many 
samples, those two genes are in anti-phase and will have a correlation coefficient 
close to —1. The absence of linear relationship between the expression levels of 
both genes will result in a correlation coefficient close to 0. Comparisons between 
correlations obtained in two independent groups were done using a Fisher’s Z 
transformation followed by a statistical test using pnorm R function. 

The principle underlying DW algorithms'*” is that for a given candidate ‘mas- 
ter regulator’ node gene X, the global DW score—when comparing two experi- 
mental conditions 1 and 2—is the sum of DW subscores between gene X and each 
of the other genes Gi queried. The subscore between the gene of interest X (for 
which the DW score is calculated) and a gene Gi is proportional to: (1) the extent 
of the shift in correlation between the expression levels of Gi and X when com- 
paring conditions 2 and 1 (thus genes exhibiting a high number of strong shifts in 
correlation with many other genes are assumed to be relevant nodes in the dif- 
ferential gene expression network between conditions 1 and 2); (2) the extent of 
differential expression of Gi between conditions 1 and 2 (averaged across the panel 
of samples for each condition; thus, the more a gene is on average differentially 
expressed between 2 conditions, the more it is predicted to have a phenotypic 
impact); (3) the level of expression Gi (a more highly expressed gene is thought to 
have a higher phenotypic impact; this is to compensate for the fact that lowly 
expressed genes are more likely to exhibit strong shifts in expression between the 
two conditions). 

The three main modifications we introduce to the previously described wiring 
algorithms'*”* are: (1) We broadened the analysis of possible ‘master regulator’ 
genes from only annotated transcription factors to all genes. (2) We introduced 
significance threshold tests for the interactor genes: as we included all the genes as 
candidate ‘master disease regulators’, instead of only all the annotated transcrip- 
tion factors we wanted to avoid artificial results when working at a genome-wide 
scale than with hundreds of selected genes. Low-selective threshold (P value = 0.05) 
were however chosen to keep a high sensitivity. (3) We considered for differential 
expression as well as for differential correlation the intersection between 2 com- 
parisons involving 4 independent groups. For a gene to be considered as differen- 
tially expressed, it has to be significantly differentially expressed in both conditions 
with the same orientation. 

The differential wiring score for a gene X between for two comparisons each 
between two independent experimental groups (1 and 2 with n, and n, elements 
and A and B with n, and ng elements, respectively) was thus calculated as the sum 
over all the genes Gi of the absolute value of the product of: (1) The conditional 
Z-distance evaluating the difference observed between the two groups in each of 
the two comparisons for the correlation between the expression levels of genes X 
and Gi ( Agicivs2) A AB) in the formal DW formula below. Thus, for a given 


threshold P value (0.05 here), it has a null value if the correlation shift is not sig- 
nificant. The amplitude of the Z-distance is proportional to the shift in correlation 
between the two experimental conditions. Fisher’s Z-transformation corrects for 
the non-normal distribution of the correlation value (between — 1 and 1). Asa con- 
sequence, a shift in correlation form 0.7 to 0.9 will lead a Z-distance value higher 
than a shift from —0.1 to 0.1. (2) The conditional log-scaled amplitude of the diffe- 
rential expression of gene Gi (90x Giver Av) in the DW formula below. For 
Pp 


a given threshold P value (0.05 here), it has a null value if the gene is found to not 
be differentially expressed between the two conditions. If the gene is differentially 
expressed for the chosen P value, the value will be the log of the ratio between the 
averaged gene expression levels in each group. (3) The averaged expression level of 
gene Gi among all samples ((Eg, ),, )( )a\yp in the formula below. 

As a consequence of the use of significance threshold tests, only those genes 
which are differentially expressed between the two experimental conditions, and 
that see their correlation with gene X significantly changed between the two expe- 
rimental conditions, will participate in the DW score. 

Formally, the DW score was thus calculated as: 

Differential wiring score for gene X: 


DW(X)ivs2 = D> Gi (Aciawa)nass)) | (8K.Gawarvvem) Ie, 1U2UAUB 


With: 
Conditional Z-distance for a p-value p: 


Voi Sawp LF pnorm(d1ys2) <p 
AND pnorm(Saysp) <p 
(Bra) p= AND 81yq2"5 
lvs2 SAvsB 
0 OTHERWISE 


F,(1(X,G),-F,(1(X, ; 
EGF CG.) Z-distance between r(X,G),and r(X,G), 


5(X,G)iys2 1 1 
nh) — 3 m= 3 
F,(1(X. =F, (r(X. 
5(XG) avon a(r( 2s) alr i Gs) Z-distance between r(X,G), and r(X,G), 
nha= 3 ng — 3 


r(X,G),, r(X, G)2, r(XG) a, r(X G), correlation coefficient between the expression 
levels of genes X and G, evaluated in experimental groups 1 (n, elements), 2 (n2 
elements), A (ng elements) B (ng elements). 


1l+r 
FZ(r) = jloe( ;) Bsher s z transformation for a correlation coefficient r 


Conditional differential gene expression amplitude of Gi 


(Agjcavs2)\(AvsB) a = 


bs (=) (=) IF p, value (t,test((Eg) ,.(Ec pee »)) <p 


a) \Eoe AND p, value (t,test((Eg),,(Ec)3)) <p 
(Eg); (Ea) a 
AND log .log >0 
(Ec)2 (Ea)p 
0 OTHERWISE 


(Eg): collection of the expression level values for gene G among an experimental 
group i. All calculations were performed using the R statistical environment. 
GWAS epistasis analysis. We used 4 publicly available GWAS data sets; GenADA 
study was downloaded from NCBI’s dbGap repository”’ (Multi-Site Collaborative 
Study for Genotype-Phenotype Associations in Alzheimer’s disease and Longi- 
tudinal follow-up of Genotype-Phenotype Associations in Alzheimer’s disease 
and Neuroimaging component of Genotype-Phenotype Associations in Alzheimer’s 
disease, dbGap phs000219.v1.p1), which includes 875 AD cases and 850 controls*. 
Two Tgen neuropathological data sets were obtained from the Translational 
Genomics Research Institute (http://www.tgen.org). The ‘discovery cohort’ 
includes 446 AD cases and 290 controls, the ‘replication cohort’ 197 AD cases 
and 114 controls**. The ADNI data sets were downloaded from the Alzheimer’s 
Disease Neuroimaging Initiative website (https://ida.loni.ucla.edu) and includes 
180 AD cases and 214 controls”’. All subsequent genetic analysis were done using 
gPLINK” software. Interaction between different SNPs and APOF4 in each data 
set were evaluated using PLINK linear and interaction functions, applied to the 
following model AoO = a+ b.APOE + c.Sex + d.SNP + e. SNPxAPOEbin, where 
AoO is the age-of-onset of Alzheimer’s disease, APOE is the allelic load of APOE 4 
allele (= 0, 1 or 2), Sex is the gender ( = 0 or 1), SNP the allelic load of the minor 
allele for the studied SNP (= 0, 1 or 2) and APOEbin a binary indicator for the 
presence or absence of at least one APOF4 allele (= 0 or 1). The term e is con- 
sidered as the interaction term in the current analysis. Rare APOE2 allele carriers 
were not considered in the present study. Analysis was performed independently 
in each data set, and then combined in a meta-analysis using METAL (http:// 
www.sph.umich.edu/csg/abecasis/metal/). 

AD age-of-onset GWAS set enrichment analysis. GWAS enrichment analysis 
was performed essentially as previously described”*, using the P value resulting 
from the APOE4-dependent AD age-of-onset association meta-analysis described 
above. For each gene present in the gene expression data set the associated P value 
corresponds to the lowest meta-analysis P value for a SNP located within 50-kb on 
either side of the gene, using the 286,178 SNPs present in at least 3 of the 4 studies 
used in the meta-analysis. A gene set enrichment score was calculated for the 
different groups of genes considered using a weighted Kolmogorov-Smirnov-like 
running-sum statistic as previously described**. For each gene set, a null distri- 
bution was generated by permuting 10,000 times the gene labels, and was used to 
evaluate an empirical P value for the enrichment score. All calculations were made 
using R statistical software. 

Florbetapir (AV-45) PET-scan epistasis analysis. Data was downloaded from 
the Alzheimer’s Disease Neuroimaging Initiative website (https://ida.loni.ucla.edu) 
and includes data from 177 cognitively intact and 338 mildly cognitively impaired 
elderly genotyped individuals. Briefly, AB load was evaluated by florbetapir-PET 
and quantified as a standardized uptake value ratio (SUVR)”° in different brain 
regions (parietal cortex, cingulate cortex, frontal cortex, temporal cortex, brain- 
stem and cerebellum). The effect of rs2248663 minor allele (G) or APOE4 allele 
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load on the SUVR was evaluated for each group using an additive model within the 


PLINK linear function, and adjusted for gender and age. 
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metal-rich and a metal-poor globular cluster 


B. M. S. Hansen’, J. S. Kalirai?*, J. Anderson”, A. Dotter*, H. B. Richer’, R. M. Rich’, M. M. Shara®, G. G. Fahlman’, J. R. Hurley’, 
1. i 


R. King’, D. Reitzel’ & P. B. Stetson’ 


Globular clusters trace the formation history of the spheroidal 
components of our Galaxy and other galaxies’, which represent 
the bulk of star formation over the history of the Universe’. The 
clusters exhibit a range of metallicities (abundances of elements 
heavier than helium), with metal-poor clusters dominating the 
stellar halo of the Galaxy, and higher-metallicity clusters found 
within the inner Galaxy, associated with the stellar bulge, or the 
thick disk**. Age differences between these clusters can indicate the 
sequence in which the components of the Galaxy formed, and in 
particular which clusters were formed outside the Galaxy and were 
later engulfed along with their original host galaxies, and which 
were formed within it. Here we report an absolute age of 9.9 + 0.7 
billion years (at 95 per cent confidence) for the metal-rich globular 
cluster 47 Tucanae, determined by modelling the properties of the 
cluster’s white-dwarf cooling sequence. This is about two billion 
years younger than has been inferred for the metal-poor cluster 
NGC 6397 from the same models, and provides quantitative evid- 
ence that metal-rich clusters like 47 Tucanae formed later than 
metal-poor halo clusters like NGC 6397. 

The ages of globular clusters are most easily determined by model- 
ling the properties of main-sequence and horizontal-branch stars*’, 
but doing so leads to uncertainties caused by covariance between the 
measured age and the spectroscopic metallicity required to compare 
observed colours and magnitudes to theoretical models. The metals in 
the atmospheres of white dwarfs sediment out because of the strong 
stellar gravity, resulting in an age determination more robust with 
respect to metallicity correlations. In such cases, the age is based on 
the monotonic cooling of white dwarfs, because the luminosity func- 
tion peak moves to fainter magnitudes for older populations. The 
consistency of the two approaches has been demonstrated with the 
determination of absolute ages for the globular clusters Messier 4 and 
NGC 6397 (refs 10-13). Our attempt to extend the application of this 
method (fitting the cooling sequence of the white dwarfs) to the metal- 
rich cluster 47 Tucanae is driven by the fact that the covariance between 
metallicity and age in the main-sequence fitting method becomes stron- 
ger with increasing metallicity, indicating the need for an independent 
determination. 

We observed a field at roughly the half-mass radius of the cluster 
with the Hubble Space Telescope. The primary science observations 
were performed with the Advanced Camera for Surveys, with observa- 
tions split between the wide F606W (total integration of 163.7 ks) and 
F814W (total integration of 172.8 ks) bandpasses. The field was also 
chosen to overlap with prior Hubble Space Telescope observations, so 
that the cluster motion results in positional shifts of cluster members 
with respect to background stars, and so cluster and non-cluster mem- 
bers can be separated. Details of the photometry, completeness and 
measurement of the proper motions is described in the Supplementary 
Information. 


The white-dwarf population is isolated by both colour and proper 
motion, as shown in Fig. 1. The distribution of white dwarfs with lumin- 
osity (the luminosity function) can be compared (Fig. 2) with the white- 
dwarf luminosity function of a metal-poor, globular cluster with deep 
Hubble Space Telescope observations, called NGC 6397 (ref. 12). The 
abrupt truncation of these luminosity functions is a manifestation of the 
finite age of a population, as white dwarfs cool monotonically, and this 
cutoff moves to fainter magnitudes as a population ages. In Fig. 2 we have 
adjusted the magnitudes of the NGC 6397 population to correct for the 
different cluster distances, and so placed it on the same magnitude scale 
as the 47 Tucanae population. The 47 Tucanae luminosity function 
drops to half of the peak value at approximately 0.4 magnitudes brighter 
than that of NGC 6397, a direct demonstration that this cluster is 
younger. A comparison to a third cluster, Messier 4 (ref. 13), shows a 
similar age difference. 
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Figure 1 | The 47 Tucanae white-dwarf cooling sequence. The filled black 
points on the magnitude-colour plot are the proper-motion-selected white 
dwarfs of 47 Tucanae, and the small green points are those objects that fall 
outside the proper-motion cut. At brighter magnitudes (redder colours) these 
are members of the main-sequence population of the Small Magellanic Cloud, 
whereas at fainter magnitudes (bluer colours) they are mostly background 
galaxies. The grid outlined in red shows the binning used to compare the 
observations to the Monte Carlo populations. 
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Figure 2 | Luminosity function comparison between 47 Tucanae and 
NGC6397. The solid points and histogram show the white-dwarf luminosity 
function for 47 Tucanae, while the open circles and dashed histogram is the 
corresponding quantity for NGC 6397 (ref. 12), with error bars based on 
Poisson statistics. Each shows the characteristic behaviour of populations that 
result from a single burst of star formation, namely a sharp rise followed by a 
relatively abrupt truncation. In the case of NGC 6397 data, we have applied a 
horizontal shift of 0.85 magnitudes to correct for the different cluster distances 
and put them on the same magnitude scale, and the vertical scaling has been 
adjusted by a global multiplicative factor of 2.9 to correct for differences in the 
absolute numbers of the different populations. We see that the 47 Tucanae 
luminosity function clearly truncates at a brighter luminosity than that of NGC 
6397, indicating that this cluster is younger. 


To determine a quantitative measure of the age difference, we con- 
struct Monte Carlo models for the cooling of the cluster white-dwarf 
population, including a model for photometric shifts, scatter and 
incompleteness, based on artificial star tests. We fit these models, 
including the uncertainties in distance and extinction, to both the 
observed luminosity function and the two-dimensional distribution 
of the white-dwarf population in colour and magnitude (the Hess 
diagram). If we use the same models to fit the 47 Tucanae data as we 
did for NGC 6397 (ref. 12), we obtain an age of 9.7 + 0.4 billion years 
(Gyr) (at 95% confidence) for 47 Tucanae, compared to the corres- 
ponding value of 11.7 + 0.3 Gyr for NGC 6397. Therefore, the relative 
age difference is 2.0 + 0.5 Gyr. 

The cluster 47 Tucanae has a metallicity'*'° of [Fe/H] = -0.75, com- 
pared to [Fe/H] = -1.8 for NGC 6397. Although white-dwarf atmo- 
spheres are not sensitive to overall metallicity, the cooling may be 
affected by changes in the nuclear burning history owing to different 
metallicities. To quantify any difference introduced by variations in 
progenitor behaviour, we have calculated new evolutionary models 
using progenitors of appropriate metallicity for 47 Tucanae. Using 
the MESA code!’, we have calculated the evolution of solar-mass stars 
with [Fe/H] = -0.75 and [«/Fe] = +0.2, as well as with a range of helium 
enrichment Y (see Supplementary Information) seen in 47 Tucanae™. 
We have also verified that these calculations produce similar results to 
the models used to determine the age of the cluster from the main 
sequence””, to facilitate comparison with those earlier analyses. We find 
that the final age determination is not sensitive to either progenitor 
metallicity, helium fraction or which of the Schwarzschild or Ledoux 
criteria for convection is assumed when calculating the mixing of the 
progenitor core. Neither is the result sensitive to the use of different 
atmospheric models for correcting effective temperatures to observed 
colours. Details of this analysis are discussed in the Supplementary 
Information. Marginalizing over all these models results in an age of 
9.9 + 0.7 Gyr for 47 Tucanae. Figure 3 shows the comparison between 
the best-fit model and the data, binned according to the grid in Fig. 1. 

Estimates based on fitting main-sequence models to the main- 
sequence turnoff report ages between 10 Gyr and 13 Gyr, with the range 


52 | NATURE | VOL 500 |1 AUGUST 2013 


due to a variety of factors including differences in adopted models 
physics (such as nuclear reaction rates and gravitational settling) as well 
as different bandpasses employed in the observational data*”**. The 
most precise age so far obtained from the main sequence of 47 Tucanae 
comes from the detached, eclipsing binary star V69 (ref. 24). V69 
comprises two stars that lie just above and below the main-sequence 
turnoff, meaning that the pair provides a particularly valuable age 
constraint. The resulting age is somewhat sensitive to the assumed 
metallicity of the stars. The original age estimate for V69 (ref. 24) 
was 11.25 + 0.21 + 0.85 Gyr, denoting random and systematic errors, 
respectively. This result is based on the assumption of [Fe/H] = —0.7 
and [«/Fe] = +0.4. Adopting a chemical composition in accordance 
with more recent studies'*"'° and employing a more robust estimate of 
the systematic error implies that the age derived from the binary system 
is 10.39 + 0.54 Gyr, here citing only the systematic error. This estimate 
is fully consistent with the white-dwarf result given above, suggesting 
that consistency between the two age measurement techniques is pos- 
sible, provided we also have accurate determination of extrinsic pro- 
perties such as distance, reddening and metallicity. 

The bimodality in the metallicity distribution of globular clusters 
appears to be ubiquitous among large galaxies’, and is held to reflect 
the sequence in which galaxies at high redshifts assemble their stellar 
mass. It is still disputed whether the two classes represent two distinct 
epochs of stellar assembly within a single dark matter halo, or whether 
some clusters are assembled early in smaller haloes and then later 
accreted’ *’, Within this context, our result supports the idea that there 
is a measurable age difference between metal-poor and metal-rich globu- 
lar clusters in the Milky Way. Figure 4 shows the relationship between 
white-dwarf-based age and metallicity for several Galactic populations. 
Including the uncertainties due to stellar models, 47 Tucanae has an 
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Figure 3 | Model comparison to white-dwarf colour-magnitude 
distribution. In each panel, the points show the number count of the white- 
dwarf population as a function of colour at fixed magnitude. Each panel is 
labelled with the corresponding central F814W magnitude. Filled circles are 
points that are included in the grid in Fig. 1, whereas open circles are not 
included, because they are expected to be dominated by residual contamination 
from background populations. Error bars are Poisson errors. The histogram is 
the model distribution binned in the same way, for an age of 9.7 Gyr and 
distance to the cluster figoew = 13.38 (see Supplementary Information). 
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Figure 4 | Age—-metallicity relation based on white dwarfs. Each point shows 
the age of a Galactic population as determined using the white-dwarf 
luminosity function, and the corresponding metallicity determined from main- 
sequence stars. The age of 47 Tucanae is determined here, while those of NGC 
6397 (ref. 12), Messier 4 (ref. 13) and the open cluster NGC 6791 (ref. 28) are 
taken from the literature. The age of the thin disk is taken from an analysis of 
the white-dwarf population in the solar neighbourhood” and the metallicity 
error bar is determined from the range of metallicities measured for nearby 
stars”. The age uncertainties for all populations are quoted from the given 
references. The redshift axis uses the age-redshift relation appropriate to the 
WMAP (currently accepted) cosmology’. This demonstrates that the age 
difference derived here corresponds to the difference between z ~ 3 and z ~ 2. 


absolute age of 9.9 + 0.7 Gyr, which places its formation at a redshift of 
approximately two. This age is consistent with the model in which the 
metal-rich globular clusters are formed by the massive, star-forming 
events that are responsible for the peak in the global star-formation rate 
at this redshift, whether driven by mergers or in situ formation”. 
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Nanometre-scale thermometry in a living cell 


G. Kucsko!, P. C. Maurer", N. Y. Yao!, M. Kubo?, H. J. Noh®, P. K. Lo*, H. Park!?? & M. D. Lukin! 


Sensitive probing of temperature variations on nanometre scales is 
an outstanding challenge in many areas of modern science and 
technology’. In particular, a thermometer capable of subdegree 
temperature resolution over a large range of temperatures as well 
as integration within a living system could provide a powerful new 
tool in many areas of biological, physical and chemical research. 
Possibilities range from the temperature-induced control of gene 
expression” > and tumour metabolism’ to the cell-selective treat- 
ment of disease”* and the study of heat dissipation in integrated 
circuits’. By combining local light-induced heat sources with sensi- 
tive nanoscale thermometry, it may also be possible to engineer 
biological processes at the subcellular level” >. Here we demonstrate 
a new approach to nanoscale thermometry that uses coherent mani- 
pulation of the electronic spin associated with nitrogen-vacancy 
colour centres in diamond. Our technique makes it possible to detect 
temperature variations as small as 1.8 mK (a sensitivity of 9 mK Hz") 
in an ultrapure bulk diamond sample. Using nitrogen-vacancy centres 
in diamond nanocrystals (nanodiamonds), we directly measure the 
local thermal environment on length scales as short as 200 nano- 
metres. Finally, by introducing both nanodiamonds and gold nano- 
particles into a single human embryonic fibroblast, we demonstrate 


temperature-gradient control and mapping at the subcellular level, 
enabling unique potential applications in life sciences. 

Many promising approaches to local temperature sensing’ are being 
explored at present. These include scanning probe microscopy’, 
Raman spectroscopy’®, and fluorescence-based measurements using 
nanoparticles'*”* and organic dyes'*™*. Fluorescent polymers”* and green 
fluorescent proteins” have recently been used for temperature mapping 
within a living cell. However, many of these existing methods are 
limited by drawbacks such as low sensitivity and systematic errors 
due to fluctuations in the fluorescence rate’’”’, the local chemical 
environment’? and the optical properties of the surrounding medium™. 
Moreover, although promising, methods based on green fluorescent 
proteins rely on cellular transfection’* that proves to be difficult to 
achieve in certain primary cell types’’. Our new approach to nanoscale 
thermometry uses the quantum mechanical spin associated with nitrogen— 
vacancy colour centres in diamond. As illustrated in Fig. 1b, in its 
electronic ground state each nitrogen-vacancy centre constitutes a 
spin-1 system. These spin states can be coherently manipulated using 
microwave pulses and efficiently initialized and detected by means of 
laser illumination (Supplementary Information). In the absence of an 
external magnetic field, the precise value of the transition frequency 
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Figure 1 | Nitrogen-vacancy-based nanoscale thermometry. a, Schematic 
image depicting nanodiamonds (grey diamonds) and a gold nanoparticle 
(yellow sphere) within a living cell (central blue object; others are similar) with 
coplanar waveguide (yellow stripes) in the background. The controlled 
application of local heat is achieved by laser illumination of the gold 
nanoparticle, and nanoscale thermometry is achieved by precision 
spectroscopy of the nitrogen—vacancy spins in the nanodiamonds. b, Simplified 
nitrogen—-vacancy level diagram showing a ground-state spin triplet and an 


excited state. At zero magnetic field, the | +1) sublevels are split from the |0) 
state by a temperature-dependent zero field splitting A(T). Pulsed microwave 
radiation is applied (detuning, 5) to perform Ramsey-type spectroscopy. 

c, Comparison of sensor sizes and temperature accuracies for the nitrogen- 
vacancy quantum thermometer and other reported techniques. Red circles 
indicate methods that are biologically compatible. The open red circle indicates 
the ultimate expected accuracy for our measurement technique in solution 
(Methods). 
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(A) between the |m, = 0) and|m, = +1) states (m,, spin projection) has 
a temperature dependence (d4/dT = —2n X 77 kHzK_') due to ther- 
mally induced lattice strains'®*. 

The operational principle of nitrogen—vacancy-based thermometry 
relies on the accurate measurement of this transition frequency, which 
can be optically detected with high spatial resolution (Fig. 1). For a 
sensor containing N colour centres, the temperature sensitivity is given 
by 

1 1 


1 CdA/aT JT, NE 


where T.,, is the nitrogen-vacancy spin coherence time and t is the 
integration time. Here we also introduce a factor, C, to account for 
imperfections in readout and initialization”’. If T.,), is of the order of a 
few milliseconds and C ~ 0.03 (ref. 19), then a single nitrogen—vacancy 
can potentially exhibit a sensitivity better than 1 mK Hz“. Apart 
from high sensitivity, nitrogen—vacancy-based thermometry also offers 
several distinct advantages over existing methods in biological and 
chemical temperature sensing. First, owing to diamond’s chemical 
inertness, it is generally robust to changes in the local chemical environ- 
ment. Second, it can be applied over a wide range of temperatures 
(200-600 K; refs 17, 18), which is of particular interest in the study 
of nanoscale chemical reactions”. 

As a benchmark experiment, we demonstrate the high temperature 
sensitivity of nitrogen-vacancy-based thermometry in a bulk diamond 
sample. Although the nitrogen—vacancy’s magnetic sensitivity has ren- 
dered it a competitive magnetometer*’”’, to determine the tempera- 
ture accurately it is necessary to decouple the nitrogen—vacancy electronic 
spin from fluctuating external magnetic fields. This is achieved using a 
modified spin-echo sequence that makes use of the spin-1 nature of the 
nitrogen-vacancy defect”*, allowing us to eliminate the effects of an 
external, slowly varying magnetic field. Specifically, we apply a micro- 
wave pulse at frequency w (Fig. 1b) to create a coherent superposition 
(|0) +|B))//2, where |B) =(|+1) +|—1))/V2. After half the total 
evolution time, t, we apply a 27 echo pulse that swaps the population of 
the |+1) and |—1) states (Fig. 2a). Following another period of free 
evolution for time T, quasistatic, magnetic-field-induced shifts of these 
|+1) levels are eliminated, allowing for accurate temperature sensing. 
In the experiment, we use a sample of isotopically pure diamond (99.99% 
spinless '*C isotope) grown by chemical vapour deposition” to further 
reduce magnetic field fluctuations originating from the intrinsic ‘°C 
nuclear spin bath. As shown in Fig. 2a, this allows us to observe coherence 
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Figure 2 | Sensitivity of single nitrogen-vacancy thermometer. a, Measured 
fluorescence as a function of echo evolution time, 2t (red points); the black solid 
line indicates a fit corresponding to a damped cosine function with two distinct 
frequencies. The characteristic beating can be explained by fluctuating 
proximal charge traps separated from the nitrogen-vacancy centre by a 
distance of about 50 nm. Inset, the microwave 27 echo pulse sequence used to 
cancel unwanted external magnetic field fluctuations”. b, Measured 
fluorescence (red points) as a function of temperature for echo times of 
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fringes for up to 0.5 ms. Notably, for all nitrogen—vacancies tested, we 
observe a characteristic low-frequency beating of the fluorescence sig- 
nal that varies from defect to defect, which is most probably due to 
locally fluctuating charge traps”. Despite this beating, for a fixed evolu- 
tion time 27, the nitrogen—vacancy spin depends sensitively on the 
sample temperature (Fig. 2b). We observe a temperature sensitivity 
of 7 =(9 + 1.8) mKHz "? for 2t = 250 us. For 30s of integration, we 
achieve a measurement accuracy of 67 = 1.8 +0.3mK (Methods). 
Although the measurement sequence for a single value of 2t allows 
us to determine the temperature only up toa multiple of (2d4/d7T21) ', 
absolute temperature variations can be determined by repeating the 
measurement for 2t’ < 21, as shown in Fig. 2b. 

We now demonstrate the high spatial resolution of nitrogen—vacancy- 
based thermometry. This is achieved by using nanodiamonds. In most 
commercially available nanodiamonds, the nitrogen-vacancy coher- 
ence time is limited to approximately 1 jis owing to additional para- 
magnetic impurities. Although this shortened coherence time reduces 
the intrinsic temperature sensitivity for a single defect, this decrease 
can be offset by using an ensemble of nitrogen—vacancies to enhance the 
signal-to-noise ratio by a factor of VN. We note that, unlike nitrogen— 
vacancy-based magnetometry, where the proximity to the source 
(often limited by the nanodiamond size) is critical to the maximum 
sensitivity, nitrogen—-vacancy thermometry is not subject to such a 
constraint; in fact, the excellent thermal conductivity of diamond 
ensures that all nitrogen—vacancy centres within a nanocrystal are in 
thermal equilibrium with the local heat environment. To maximize the 
number of nitrogen—vacancy centres and to minimize the lattice strain, 
our measurements are performed on single-crystalline nanodiamonds 
containing approximately 500 nitrogen-vacancy centres (Adamas 
Nanotechnologies). The zero-field splitting, 4, of the nitrogen—vacancy 
ensemble, and, thus, the temperature, is determined by recording a 
continuous-wave electron spin resonance (ESR) spectrum. Specifically, 
we measure changes to the zero-field splitting by recording the fluore- 
scence at four different frequencies centred around A = 27 X 2.87 GHz 
(Fig. 3a). This procedure eliminates unwanted effects from fluctuations 
in the total fluorescence rate, ESR contrast, Rabi frequency and mag- 
netic field, yielding a robust thermometer (Methods). 

Combining our nanodiamond thermometer with the laser heating 
of a gold nanoparticle allows us to both monitor and control temper- 
ature at nanometre length scales (Fig. 3). Both nanodiamonds and gold 
nanoparticles (nominal diameter, 100 nm) are initially spin-coated on a 
microscope coverslip. Using a confocal microscope with two independent 
scanning beams, we co-localize gold nanoparticles and nanodiamonds 
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2t = 250 Ls (bottom) and 2t’ = 50 us (top). Black lines, best fits; errors, lo. The 
fixed evolution times of 2t and 27’ are indicated in a by red arrows. The 
temperature is controlled by a Peltier element at the sample mount, and the 
(local) x-axis temperature is determined using a thermistor located 
immediately next to the sample. The fluorescence is converted to population by 
normalizing to two reference measurements where the spin is prepared with 
m,; = 0 and, respectively, m; = —1. 
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with a resolution of ~100 nm (Supplementary Information). While loc- 
ally heating the gold nanoparticles by continuous illumination with a 
variable-power green laser (focused to a diffraction-limited spot), we 
simultaneously measure the temperature at the nanodiamond location 
using ESR spectroscopy. 

The ability to measure temperature with nanodiamonds is verified 
by heating the substrate temperature over a range of 2.5 K and simulta- 
neously monitoring the zero-field splitting (Fig. 3c, inset). To demonstrate 
nanoscale temperature control, in Fig. 3c (red points) we show the 
temperature change recorded by the nanodiamond as a function of 
the green-laser power applied to the gold nanoparticle at a distance of 
0.8 + 0.1 um. To verify further that the temperature change results 
from local heating, we repeat the measurement with the excitation 
laser displaced from the nanodiamond by 0.8 kum in the opposite direc- 
tion. In this case, the temperature measured by the nanodiamond 
remained constant as a function of laser power (Fig. 3c, blue points), 
thereby confirming the locality of the heat source. From a linear fit to 
the data, we estimate the accuracy of our nitrogen-vacancy sensor to be 
oT = 44 + 10 mK. The measured temperature change agrees with the 
theoretically expected temperature profile based on a steady-state solu- 
tion of the heat equation, AT(r)=Q / Anxr, where Q is the heat dis- 
sipation, x is the thermal conductivity of glass and r is the distance 
between the nanodiamond and the gold nanoparticle. As shown in 
Fig. 3b, by recording the temperature of six nanodiamonds at different 
distances from the laser-heated gold nanoparticle we find that the 
measured temperature profile (Fig. 3d, points) is in excellent agreement 
with the theoretical steady-state prediction (solid line). This allows us 
directly to estimate the temperature change at the location of the gold 
nanoparticle to be 72 + 6K. 

To demonstrate that nanodiamond thermometry is compatible with 
living cells, we introduce nanodiamonds and gold nanoparticles into 
human embryonic fibroblast WS1 cells using nanowire-assisted delivery’. 
Just as in the control experiments described above, we probe the tempe- 
rature at two different nitrogen—-vacancy locations (NV, and NV;) 
within a single cell while locally heating an individual gold nanoparticle 
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Figure 3 | Submicrometre thermometry using 
nanodiamonds. a, Frequency scan of a single 
nanodiamond containing approximately 500 
nitrogen-vacancy centres. The four red points 
indicate the measurement frequencies used to 
extract the temperature as detailed in Methods. 

b, Two-dimensional confocal scan of 
nanodiamonds (circles) and a gold nanoparticle 
(cross) spin-coated onto a glass coverslip. The 
colour bar represents fluorescence, expressed in 
counts per second (c.p.s.). ¢, Temperature of a 
single nanodiamond as a function of laser power 
for two different laser focus locations. The red data 
points depict the pronounced heating of a 
nanodiamond as a result of laser illumination of a 
nearby gold nanoparticle. The blue data points 
depict the same measurement with the laser focus 
displaced by 0.8 tm from the nanoparticle location; 
this results in negligible heating of the 
nanodiamond as a function of laser power. Inset, 
the measured temperature change of a 
nanodiamond, when the surrounding temperature 
is controlled by a Peltier element. d, Temperature 
changes measured (red points) at the six 
nanodiamond locations in b as a function of 
distance from the illuminated gold nanoparticle 
(cross). The blue curve is the theoretical 
temperature profile based on a steady-state 
solution of the heat equation. All data in this figure 
were obtained on a glass coverslip, and all error 
bars correspond to lo. 


(Fig. 4a). As shown in Fig. 4b, NVj, which is significantly closer to the 
heat source, has a stronger temperature dependence as a function of 
laser power than does NV>. Varying the incident power allows us to 
generate controlled temperature differences of up to 5 K over distances 
of approximately 7 um. To ensure that this temperature gradient is 
created by the controlled illumination of the nanoparticle and does 
not result from heating of the cellular cytoplasm, we displace the laser 
spot from the gold nanoparticle. This results in a negligible temper- 
ature change at the location of NV, with AT = —20 + 50 mK (Fig. 4b, 
green square). The increased measurement uncertainty for larger laser 
powers is the result of heating fluctuations from drift of the gold 
nanoparticle. 

The experiments shown in Fig. 4b demonstrate the submicrometre 
measurement of an intracellular heat gradient. However, the substan- 
tial heating induced by constant illumination for an extended period of 
time ultimately leads to the death of the cell, which is confirmed using a 
standard live/dead assay (calcein AM/ethidium homodimer-1). To 
demonstrate that our technique can be used within living cells, we 
increase the concentration of gold nanoparticles to allow for heat 
generation at different locations simply by moving the laser focus. 
Then we measure the temperature variation at a single nanodiamond 
(Fig. 4c, bar graph) while slightly heating the gold nanoparticles in two 
different locations (crosses). After our measurement, the viability of 
the cell is confirmed (Fig. 4c). 

Finally, we demonstrate that our method can be used to control cell 
viability. To start, we heat the cell with 12 1.W of laser power and mea- 
sure a temperature change of 0.5 + 0.2 K at the nanodiamond location; 
this corresponds to a change of approximately 10 K at the position of 
the gold nanoparticle. At this point, the cell is still alive, as confirmed by 
the absence of ethidium homodimer-1 fluorescence inside the mem- 
brane (Fig. 4d). By increasing the laser power to 120 LW, we induce a 
temperature change of 3.9+0.1K at the nanodiamond location 
(approximately 80 K at the location of the laser focus); in this case, 
the cell is flooded with fluorescence from the ethidium homodimer, 
signalling cell death. This proof-of-principle experiment indicates that 
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Figure 4 | Nanoscale thermometry in cells. a, Confocal scan of a single cell 
under laser excitation at a wavelength of 532 nm, with collection at wavelengths 
greater than 638 nm. The cross marks the position of the gold nanoparticle used 
for heating, and circles represent the location of the nanodiamonds (NV, and 
NV>) used for thermometry. The dotted line provides a guide to the eye and 
outlines the cell membrane. Colour bars indicate the fluorescence in counts per 
seconds. b, Measured change in temperature at the positions of NV, and NV, 
relative to the incident laser power applied to the gold nanoparticle. Dashed 
lines are linear fits to the data. Each point consists of an average of five 
measurements, with each individual measurement taking 4s. The error bars 


nanodiamond thermometry may enable the optimization of nanopar- 
ticle-based photothermal therapies’. 

Our experiments demonstrate that nitrogen-vacancy centres in dia- 
mond can be used as robust temperature sensors that combine the 
virtues of submicrometre spatial resolution, subdegree thermal sensi- 
tivity and biocompatibility. The sensitivity of our current measurement 
can be enhanced by increasing the relevant coherence time or the number 
of nitrogen—vacancy centres within the nanocrystal, or both. Optimizing 
these factors should allow us to reach sensitivities of 80 1K Hz” 
(Methods), yielding the ability to sense subkelvin temperature varia- 
tions with millisecond time resolution. In solution, the ultimate accuracy 
of our method will probably be limited by residual heating during the 
measurement process. As discussed in the Methods, this limit is in the 
range of 0.05-5 mK, depending on experimental conditions. Although 
the present work focuses on monitoring temperature variations, the 
use of diamond samples with low strain, or, alternatively, ensembles of 
nanodiamonds, should allow for the realization of an absolute thermo- 
meter (Methods). The spatial resolution of our method can be further 
improved by using far-field subdiffraction techniques”. 

The present observations raise a number of intriguing possibilities. 
For instance, the simultaneous measurement and control ofa subcellular 
thermal gradient could enable the accurate control of gene expression”. 
Further improvements in sensitivity may allow for real-time observations 
of non-equilibrium subcellular processes in biological and medical 
applications’’. The large dynamic range of our quantum thermometer 
and its intrinsic robustness may also allow for the direct microscopic 
monitoring and control of chemical reactions”. Moreover, combining 
our technique with two-photon microscopy” may enable in vivo 
identification of local tumour activity by mapping atypical thermo- 
genesis at the single-cell level*®. Finally the combination of thermoa- 
blative therapy with our temperature sensor could be a potent tool for 
the selective identification and killing of malignant cells without dama- 
ging surrounding tissue”* 
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(1o) are set by fluctuations in the laser heating of the gold nanoparticle. 

c, Fluorescence scan of stained cells (live/dead assay) with excitation at 494 or 
528 nm and emission at 515 nm (green; live cells) and 617 nm (red; dead cells). 
The bar graph shows the temperature of a single nanodiamond (circle) with 
local heat applied at two different locations (crosses). d, Confocal fluorescence 
scans of an individual cell under varying illumination power. Excitation occurs 
at 532 nm and collection is above 630 nm. Cell death is indicated by the 
penetration of ethidium homodimer-1 through the cell membrane, staining the 
nucleus. At low laser powers, the cell remains alive; cell death occurs as laser- 
induced heating is increased. 


METHODS SUMMARY 


Nanodiamond measurement pulse sequence. As indicated in Fig. 3a, we record 
the fluorescence at four different frequencies centred on 4 = 22 X 2.87 GHz: 
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This allows us to determine the change in temperature: 
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Here w+ + 6 are the four microwave carrier frequencies and 6B is an unknown 
static magnetic field. By using the fluorescence at these four frequencies as shown 
in equation (1), we are able to remove errors associated with changes in total 
fluorescence rate, ESR contrast, Rabi frequency and magnetic field. 
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Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Experimental apparatus, sensitivity and accuracy. Our experimental apparatus 
consists of a confocal microscope with two independent excitation-collection paths, 
allowing measurement and heating at two independent locations simultaneously. 
The experiments use either a Nikon Plan Fluor X 100, oil immersion, NA = 1.3 
objective (nanodiamonds) or a Nikon Plan Apo X 100, air, NA = 0.95 objective 
(bulk sample), resulting in C ~ 0.03, which can be further improved by employing 
a solid immersion lens or diamond nanopatterning. Microwaves are delivered 
using a lithographically defined coplanar waveguide on top of a glass coverslip. 
For experiments with nanodiamonds, we use neutral-density filters in the collec- 
tion path to avoid saturation of the avalanche photodiode. The temperature accu- 
racy, oT, for bulk diamond is estimated from the measurement shown in Fig. 2b. 
Using the standard deviation o (plotted error bars), we determine the accuracy to 
be 6T =a(2tcdA/dT)~', where c is the oscillation amplitude and 2t is the free 
evolution time. We find that for integration times t < 30 s (limited by temperature 
stability) the temperature accuracy increases in proportion to v/t, giving a sensi- 
tivity of y=5T/t. A linear dependence of the dissipated heat on laser power 
(Fig. 3b) is used to determine the measurement accuracy for nanodiamonds. A 
linear function, with slope m, is fitted to the data (Fig. 3c, red dashed line) and the 


measurement accuracy is given by T = J pal ,(Ij;—m PY /(N—1), where T; is 
the measured temperature and P; is the corresponding laser power. The error bars 
are evaluated as 


: 2 I?(n/2) 
o(6T) sry N-1F(n—)/) 


where /(-) is the Gamma distribution. 

Ultimate sensitivity. The ultimate sensitivity of our method is limited by the 
nitrogen-vacancy coherence time and the number of defect centres. In our current 
experiment, we have demonstrated a sensitivity of 9 mK Hz ”? (with a free evolu- 
tion time of 250 1s). Two natural extensions allow longer nitrogen—vacancy coher- 
ences: decreasing the '°C concentration to suppress the nuclear spin bath, and 
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further dynamical decoupling. These methods can, in principle, allow us to extend 
the evolution time to T,/2 ~ 3 ms. In combination with a nanocrystal that contains 
~1,000 nitrogen-vacancy centres, this could yield an ultimate sensitivity limit of 
80K Hz ". Further improvement may be possible by using spin-squeezed 
states. Finally, the determination of absolute temperature is limited by variations 
in the zero-field splitting due to spatially varying strain. For low-strain diamond 
samples, we find that the variation in the zero-field splitting is less than 60 kHz, 
allowing for subkelvin absolute-temperature determination. In addition, the use of 
an ensemble of nitrogen-vacancy centres in different nanodiamonds with uncor- 
related strain values allows for a further increase in absolute sensitivity by a factor 
of \/n, where n is the number of nanodiamonds. 

Ultimate accuracy in solution. In cases where our method is used to probe a 
system that is in solution (for example cells and chemical reactions), the primary 
accuracy limit is set by heat dissipation during the measurement process. In par- 
ticular, the microwave spectroscopy used to detect changes in the nitrogen- 
vacancy zero-field splitting also induces heating of the solution. In the present 
experiment, we use a lithographically fabricated microwave loop (diameter, 
200 1m) to generate an a.c. magnetic field, B~ 10 mG, for spin manipulations. 
Estimating the effective dipole field created by the microwave loop shows that the 
solution (water) absorbs 10° ° W of power, yielding a temperature increase of 5 mK 
in the steady state. By using a smaller microwave loop (20 1m) and reducing the 
duty cycle, it should be possible to decrease the heating of the solution to approxi- 
mately 50 iK. 

Injection of nanodiamonds into cells. Nanodiamonds and gold nanoparticles 
were introduced into human embryonic fibroblast WS1 cells by means of silicon- 
nanowire-mediated delivery’. Silicon nanowires were treated with 3-amino- 
propyltrimethoxysilane to give NH, functionality on the surface. Nanodiamonds 
and gold nanoparticles were subsequently attached by electrostatic binding. 
Afterwards, WS1 cells were plated on the silicon nanowire substrates and cultured 
overnight. The cells were removed by trypsin treatment and re-plated on a glass 
slide with lithographically defined strip lines for ESR measurements. The samples 
were stained with calcein AM and ethidium homodimer-1 for the live/dead assay. 
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Stretchable nanoparticle conductors with 
self-organized conductive pathways 


Yoonseob Kim!, Jian Zhu', Bongjun Yeom!, Matthew Di Prima’, Xianli Su’, Jin-Gyu Kim’, Seung Jo Yoo", Ctirad Uher 


& Nicholas A. Kotov! 


Research in stretchable conductors is fuelled by diverse technologi- 
cal needs. Flexible electronics, neuroprosthetic and cardiostimulat- 
ing implants, soft robotics and other curvilinear systems require 
materials with high conductivity over a tensile strain of 100 per cent 
(refs 1-3). Furthermore, implantable devices or stretchable displays* 
need materials with conductivities a thousand times higher while 
retaining a strain of 100 per cent. However, the molecular mechan- 
isms that operate during material deformation and stiffening make 
stretchability and conductivity fundamentally difficult properties to 
combine. The macroscale stretching of solids elongates chemical 
bonds, leading to the reduced overlap and delocalization of elec- 
tronic orbitals’. This conductivity-stretchability dilemma can be 
exemplified by liquid metals, in which conduction pathways are 
retained on large deformation but weak interatomic bonds lead to 
compromised strength. The best-known stretchable conductors use 
polymer matrices containing percolated networks of high-aspect- 
ratio nanometre-scale tubes or nanowires to address this dilemma 
to some extent®"’. Further improvements have been achieved by 
using fillers (the conductive component) with increased aspect ratio, 
of all-metallic composition’’, or with specific alignment (the way 
the fillers are arranged in the matrix)’*"*. However, the synthesis 
and separation of high-aspect-ratio fillers is challenging, stiffness 
increases with the volume content of metallic filler, and anisotropy 
increases with alignment'’. Pre-strained substrates'®’’, buckled 
microwires’* and three-dimensional microfluidic polymer networks” 


1 x VAF 


a 1xLBL 


5 x LBL 


have also been explored. Here we demonstrate stretchable conduc- 
tors of polyurethane containing spherical nanoparticles deposited 
by either layer-by-layer assembly or vacuum-assisted flocculation. 
High conductivity and stretchability were observed in both compo- 
sites despite the minimal aspect ratio of the nanoparticles. These 
materials also demonstrate the electronic tunability of mechanical 
properties, which arise from the dynamic self-organization of the 
nanoparticles under stress. A modified percolation theory incorpo- 
rating the self-assembly behaviour of nanoparticles gave an excel- 
lent match with the experimental data. 

Nanoparticles may initially appear as unfavourable candidates for the 
task of filling a polymer matrix. First of all, spherical nanoparticles have a 
percolation threshold V, that is 10 to 100 times higher than high-aspect- 
ratio nanometre-scale components. The charge transport between nano- 
particles involves a large number of nanoparticle-nanoparticle junctions, 
resulting in high contact resistance and charge carriers scattering. Addi- 
tionally, nanoparticles with a strong attraction to polymers cause stiffen- 
ing of the matrix. On the other hand, nanoparticles in a polymer matrix 
represent a more dynamic system that has greater freedom for reversible 
nanoscale restructuring, which is essential for stretchability. Although 
nanoparticle dynamics in a polymer matrix at high stress/strain levels 
is poorly understood, one should expect greater matrix mobility from 
nanoparticles than from nanotubes or nanowires. Therefore, the con- 
ducting pathways lost upon deformation could potentially be recov- 
ered in a different particle configuration. It is also important that the 


Figure 1 | Preparation of polyurethane- 
nanoparticle nanocomposites. a, Photographs of 
a free-standing film (PU/NP)s9o; a consolidated 

5 X LBL stack, a free-standing film made by VAF, 
and a consolidated 5  VAF stack. b, SEM images 
of 1 X LBL and 1 X VAF. c, Cross-sectional SEM 
images of 5 X LBL and 5 X VAF. 


5 x VAF 
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conductance between two nanoparticles does not depend on their 
mutual orientation. 

As a model system, we chose citrate-stabilized gold nanoparticles 
13.0 + 0.3 nm in diameter (Supplementary Fig. la). The thin citrate layer 
on the surface of the nanoparticles makes them negatively charged. 
Citrate coating is expected to present a minimal barrier for charge trans- 
fer between nanoparticles (Supplementary Fig. 1b). Polyurethane with 
two sets of 28 repeating units of ductile moieties (Supplementary Fig. 1c) 
was used as a polymeric partner for these particles (1.0% by volume in 
aqueous solution). Its strong positive charge is complementary to the 
negative charge of the nanoparticles and allowed us to use layer-by-layer 
(LBL) deposition to make corresponding composites**”’. The applica- 
tion of LBL allowed us to (1) reach high nanoparticle loadings, (2) main- 
tain the uniformity of nanoparticle distribution throughout the material, 
and (3) make adequate comparisons between nanoparticle composites 
with different nanoscale fillers”. We also used an alternative method of 
construction, allowing us to generalize the properties of nanoparticle 
composites as stretchable conductors, in which the nanoparticle materials 
were obtained by vacuum-assisted flocculation (VAF). 

Free-standing composite films were obtained after 500 LBL deposi- 
tion cycles and the resulting polyurethane-nanoparticle film is denoted 
(PU/NP)s00. Their thickness was determined by scanning electron 
microscopy (SEM) to be 2.0+ 0.2m. The ellipsometric thickness 
for one to ten layer pairs gave a total thickness of 1.97 + 0.1 um for 
500 layer pairs (Supplementary Fig. 2a). It is nearly identical to the 
SEM thickness, which indicates excellent thickness control of the LBL 


growth**. Similar free-standing VAF sheets had an SEM thickness of 
30 + 3.0 um. Both LBL and VAF sheets had an unmistakably metallic 
appearance (Fig. la). LBL and VAF composites were made to have 
the same volumetric fraction of fillers Vs 21.7 vol.% (Supplementary 
Fig. 3). Experimentally determined values of V. for the LBL and VAF 
composites were 16.2 vol.% and 17.5 vol.%, respectively. To transition 
to fully macroscopic materials as-prepared LBL and V AF free-standing 
films were laminated into stacks”. Hot-pressing 3-10 free-standing 
sheets together at 120°C and a pressure of 20 MPa for 1h revealed a 
high degree of consolidation (Fig. 1c). The lamination temperature was 
chosen because no phase change was observed up to 130°C for all 
materials according to differential scanning calorimetry data (Sup- 
plementary Fig. 4). The laminated samples of five sheets were denoted 
5 X LBL or 5 X VAF and had SEM thicknesses of 6.5 + 0.7 um and 
110 + 10 um, respectively. 

LBL and VAF films had different properties even when the gold 
content was the same. The LBL composites had better dispersed nano- 
particles, leading to more efficient conducting pathways than did the 
VAF composites, whose nanoparticles were more aggregated. Simul- 
taneously, the presence of larger polyurethane domains resulted in a 
higher stretchability of the VAF composites (Fig. 2h-k and Supplemen- 
tary Fig. 5). The free-standing 1 X LBL and 1 X VAF had conductivities 
o of 6,800 Scm~! and 510S cm and maximum tensile strains ax Of 
16% and 75%, respectively. Lamination increased the conductivity of 
both types of film and considerably improved stretchability. The con- 
ductivities of the 5 X LBLand 5 X VAF composites were 11,000S cm * 


a © 250 105 b 408 Figure 2 | Mechanical and electrical 
©, 5, by equation (1) 4104 = = LBL properties of polyurethane- 
‘@ 200; 410° § 210 2 nanoparticle nanocomposites. 
® 410? w a a, Dependence of strain and 
@ 150+ ee 1xLBL igi & ~ 103} VAF ductivi 
B -» = 1x VAF = £ conductivity of 1 x LBL and 
s 100} 1 s 2 402 1 X VAF on the volumetric gold 
E | a 3 3 F content at e = 0%, V;. The green line 
£ SOP }493 5 7 10°; shows the calculated conductivity of 
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1 17 19 21 23 5 0 2 j4 6 8 10 power-law relation in equation (1) 
Vo (vol.%) Number of films laminated and three-dimensional percolation 
c 60 d 60 theory. b, Conductivity data of 
sal ea: —1xVAF ——3xVAF laminated films composed of 1, 3, 5 
= ——41LBL = —5xVAF —10xVAF and 10 consolidated films for LBL 
& 40r agiar s and VAF. Error bars in a and b are 
> 30 % a mean = s.d. (n = 3). ¢, d, Stress— 
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a 10 LBL a (c) and VAF (d) stacks composed of 
10 1, 3, 5 and 10 films, respectively. 
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Stretching cycles 


and 1,800Scm ! fore = 0% (Fig. 2b). Compared to the conductivities 
of other LBL films from gold nanoparticles stabilized by longer capping 
agents” , those of our composites were more than 10° times better (see 
Supplementary Fig. 1 and Supplementary Table 1 for details). The max 
values for 5 X LBL and 5 X VAF were 115% and 486%, respectively 
(Fig. 2c and d). The conductivity of 5 X LBL and the stretchability of 
5 X VAF are, to the best of our knowledge, the highest for previously 
studied composites. 

As expected, the conductivity of LBL and VAF composites decreased 
with the increase of strain ¢ (Fig. 2e). ¢ = 60% resulted in a reduction of 
conductivity to 3,500 $ cm! for 5 X LBLanda reduction to210S cm! 
for 5X VAB. ¢=110% resulted in a reduction to 2,400Scm ‘ for 
5 X LBL and a reduction to 94S cm‘ for 5 X VAF. However, the con- 
ductivity values for high strain were slightly or considerably higher than 
similar values for carbon-nanotube-based materials*’, despite the much 
smaller aspect ratio of the nanoparticles. Moreover, no other compo- 
sites have displayed ¢ = 35S cm _' at ¢ = 480%, as 5 X VAF does. 

It is thus essential to understand how these composites retain the capa- 
bility to transport electrons efficiently at very high deformations. From 
the outset we knew that nanoparticles are capable of self-assembling in 
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solution into chains”®. Atomic force microscopy, SEM and transmission 
electron microscopy (TEM) images indicated that nanoparticles in the 
LBL composites produced chains of 20-40 nanoparticles while they were 
being deposited (Fig. 2h-j). Such chains can behave similarly to high- 
aspect-ratio nanocomponents, reducing V, (ref. 27) while displaying a 
high conductance similar to that of bulk gold. This ability to self-assemble 
at the time of LBL deposition was initially hypothesized to be the cause of 
such an unusual combination of properties. We, however, could not find 
evidence for nanoparticle-chain formation in the bulk from small-angle 
X-ray scattering (SAXS) data immediately after deposition at 0% strain 
(Fig. 3g, i, and 1) or by using high-voltage electron microscopy (Sup- 
plementary Fig. 6) for the LBL and VAF composites. We next considered 
the possibility that although they may be almost fully disorganized ini- 
tially, the nanoparticles can self-organize under stress. Some elements of 
self-organization have been seen for low filler concentrations and high- 
aspect-ratio nanomaterials*’**”’. 

Indeed, we found clear evidence of stress-induced nanoparticle 
organization in both LBL and VAF composites from TEM images, 
SAXS data and SEM images (Fig. 3). First of all, TEM images of ultra- 
thin (130 nm) LBL films under tensile strain showed a high mobility of 


Figure 3 | Reorganization of 
nanoparticles under stress. 

a-f, TEM images of (PU/PAA)2(PU/ 
NP),(PU/PAA), under strains of 0%, 
10%, 20%, 30%, 40% and 50%, 
respectively, where PAA is 
polyacrylic acid. g-k, SAXS beam 
images of 5 X LBL and 1 X VAF at 
various strains. 1, SAXS diffraction 
curves for films at different strains. 
P(q) is form factor from the size and 
shape of nanoparticles. The 
diffraction curve segments around 
the peak at q = 0.045 A“! are given 
in the inset. m, Calculated 
conductivity dependence on strain 


m 
for LBL composites described by 
— percolation theory for self- 
5 assembling systems (equation (2) 
Q with various self-assembly 
= coefficients « in comparison with 
5 actual data for 5 X LBL. n-u, SEM 
3 images of focused-ion-beam milled 
8 5 X LBL and 5 X VAF at various 
402 : ‘ ; ; strains. The images in q and u were 
0 25 50 75 100 taken after five consecutive stretches 
Strain (%) to 110% and 200%, respectively. 


Milling depths for samples in this 
study are 1 jum for 5 X LBL and 2 jim 
for 5 x VAF. Stretching directions 
are indicated by the double arrows. 
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1x LBL 


Figure 4 | Viscoelastic properties of 


hae 


polyurethane-nanoparticle nanocomposites for 
different applied voltages. a, b, Damping 
parameter tand and storage modulus for 1 X LBL, 
respectively. c, d, Damping parameter tand and 
storage modulus of 1 X VAF, respectively. 
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the nanoparticles in the polymer matrix. As the strain increased, the 
nanoparticles gradually re-organized into bands along the stretching 
direction (Fig. 3a—f). In the SAXS data the well developed hourglass- 
shaped scattering patterns were observed at higher elongations of 
é€=50% for both composites, confirming that nanoparticles were 
organized in elongated structures (Fig. 3h, j, and k). The intensity of 
the diffraction peak at scattering vector q = 0.045 A also substan- 
tially increased at ¢ = 50% for both composites (Fig. 31). 

Unlike other stress-induced composite reorganization, the beam 
patterns and intensity plots of SAXS data that developed upon stretch- 
ing disappeared when the tension was released. The self-assembled 
nanoparticle patterns observed here were remarkably different from 
other patterns observed in both solutions and solids****”’. To reveal 
the nanoparticle patterns in the bulk of the composites, focused-ion- 
beam milling was applied to a depth of 1-2 um. SEM images of such 
samples remaining under ¢ = 110% for 5 X LBL and ¢ = 200% for 
5 X VAF revealed three-dimensional cellular self-organized patterns 
with characteristic dimensions of 1-5 1m (Fig. 30, p, s and t). These 
distinct nanoparticle networks can be the result of local phase separa- 
tion of the nanoparticles in the polymer under high strain. 

Importantly, when the tension was released, these cellular networks 
were not observed even after five consecutive stretches, indicating that 
the nanoparticle networks were reconfigurable (Fig. 3q and u). However, 
slight irreversible reconstruction of the material did occur on the surface, 
leading to formation of bands of nanoparticles running perpendicularly 
to the stress direction and decreased stiffness (Supplementary Fig. 7). 
With ¢ = 5% (the elastic limit of both composites is considered to be less 
than ¢ = 10%), the films were repetitively stretched to observe changes in 
conductivity with deformation cycling. Interestingly, conductivity after 
5,000 cycles increased substantially by 1.7-fold for 1 LBL and 1.5-fold 
for 1 x VAF (Fig. 2g). The resistance of 1 X LBLand 1 X VAF films after 
10,000 stretch cycles, measured parallel and perpendicular to the stretch- 
ing direction, also confirmed the gradual internal rearrangement of the 
nanoparticles, which are completely isotropic without stress. Resistance 
in both directions decreased but decreased slightly more in the parallel 
direction than in the perpendicular direction. SAXS data for 1 * LBL 
and 1 X VAF films after 10,000 stretch cycles also confirmed the reor- 
ganization of nanoparticles (Supplementary Fig. 8). 

The prepared composites presented an interesting case for percolation 
theory because of (1) the unusually high conductance of particles with 
minimal aspect ratio and (2) the extensive structural characterization 
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(Figs 1-3). We note that there are no examples of applications of per- 
colation theory to self-organized systems. The classical power law relates 
the conductivity of an unstretched material o to V;as follows: 


o=a0(Ve - vey (1) 


where oq is the conductivity of the filler, V?, and V° are the volumetric 
fractions of the filler and percolation threshold at ¢ = 0%, respectively, 
and sis the critical exponent. Unambiguous application of equation (1) to 
VAF composites is difficult owing to aggregation of nanoparticles (Fig. 2k 
and Supplementary Fig. 9). Also the existing method? is unable to 
describe percolation correctly at high strains, as is the case for VAF 
composites (see comments to Supplementary Fig. 9). Therefore, we 
restricted consideration of percolation theory to LBL composites, which 
represent an almost ideal case for this theory owing to the nearly uniform 
distribution of nanoparticles. Indeed, o,py calculated from equation (1) 
showed excellent agreement with 0); , in the absence of strain at 
various Vv; (Fig. 2a). The conductivities of the composites for different 
strains were calculated according to a recently developed method’ but 
showed significant discrepancies. As strain increases, the calculated o 
dropped faster than did the experimental data, despite the general tend- 
ency of the method to overestimate the conductance for large strains. This 
finding showed that the current percolation theory cannot explain the 
dynamic nature of conductive pathways under strain. The influence of 
stress-induced self-organization processes could be described as an 
apparent increase of the aspect ratio of the fillers, which reduced V, 
and hence increased o. Thus, V? in equation (1) was modified by intro- 
ducing a self-assembly coefficient « with square-root dependency of ¢ as 
follows: 
0 


1+ ae (2) 


When « = 0.14 the theoretical curve fitted the experimental data nearly 
perfectly (Fig. 3m). As the coefficient « increases the ability of particles to 
reorganize to form conductive pathways becomes more pronounced. The 
numerical value of « reflects the effects of both the ability of nanoparticles 
to form non-random associations in solids and their directionality. 

An understanding of charge transport in these materials is academ- 
ically and practically essential. Fitting the temperature dependence of 
conductivity from 2 to 300K with variable-range charge transport 
equations revealed clear metallic behaviour via direct nanoparticle 


Ve) = 
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contact (see Fig. 2f and Supplementary Fig. 10 for details), whereas the 
temperature dependence of conductivity from 2 to 300 K in the stretch- 
able conductors from carbon nanotubes showed semiconducting beha- 
viour and hopping electron transport’’. Importantly, gold nanoparticle 
composites had high electron mobilities of 1.75 X 10 *m* V's ' for 
1X LBL and 5.45 X10 °m?V ‘s ! for 1 X VAE, and the carrier con- 
centration of 5.83 X 10°*m~* for 1 LBL at 300K approached very 
closely to that for bulk gold, 5.90 x 107° m™* (Supplementary Fig. 11). 

As one of many possible demonstrations of the practical relevance of 
the new composites, we performed a dynamic mechanical analysis 
under small voltage to illustrate the electro-tunability of their viscoe- 
lastic properties. The relaxation of composites at a frequency of 1-10 Hz 
significantly increased when a small voltage of 0.2 V was applied, owing 
to repulsive transient charges on nanoparticles. As such, the damping 
parameter tand increased by 2.2-fold and 1.6-fold and the storage 
modulus decreased by 2.1-fold and 1.5-fold for 1X LBL and 
1 X VAF, respectively (Fig. 4 and Supplementary Fig. 13). Solid materi- 
als with electro-tunable mechanical properties are more practically 
viable than metallic structures filled with liquid electrolytes”. 

From a fundamental perspective, these composites with high gold- 
nanoparticle loading displayed a previously unknown reorganization 
into cellular networks under stress. To some degree such restructuring 
imitates the behaviour of atoms in liquid metals, while retaining struc- 
tural integrity and strong bonding. The nanoparticle dynamics in such 
composites can be described by a variation of classical percolation 
theory and can be extended to a variety of other nanoparticle-polymer 
systems. Further improvements can be achieved by the development of 
commercially affordable conducting fillers and new polyelectrolytes to 
optimize properties. Further fabrication of such composites could aim 
to meet the demands of applications in medicine, optoelectronics, and 
energy storage devices that require electro-tunable properties. 


METHODS SUMMARY 


We used as-received cationic polyurethane aqueous dispersion (30 vol.%, molecu- 
lar mass approximately 92,000, from Hepce Chem, South Korea) after diluting to 
1 vol.% solutions in deionized water. Gold(m)chloride trihydrate, sodium citrate 
tribasic dihydrate (Sigma-Aldrich) were used as-received. Citrate-stabilized gold 
nanoparticles with diameter 13 + 0.3 nm were synthesized. 

For LBL assembly, substrates were sequentially dipped into a diluted solution of 
polyurethane and a dispersion of concentrated gold nanoparticles, with each step 
followed by rinsing with deionized water and drying with compressed air. This deposi- 
tion cycle constitutes a pair of layers, and can be repeated as many times 1 as necessary 
to obtain the desirable thickness. The desired gold content in the composites was 
acquired by controlling the dipping time of a substrate to achieve the required disper- 
sion of gold nanoparticles. 

For VAF film preparation, we stirred a mixture of as-prepared gold nanopar- 
ticles and a diluted solution of polyurethane, followed by filtration. The desired 
gold content in the composites was achieved by controlling the volume of a dis- 
persion of as-prepared gold-nanoparticle solutions. Unless otherwise specified, 
LBL and VAF films had the same gold contents in all the tests. The described 
measurements were performed at room temperature and relative humidity of 
20-30% for both LBL and VAF composites. See Methods for details of the synthesis 
of gold nanoparticles, preparation of LBL and VAF films, and their characterizations. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Synthesis of gold nanoparticles. To ultrapure deionized water (950 ml) in a one- 
litre glass beaker equipped with a magnetic stir bar was added gold(i)chloride 
trihydrate (180.0 mg, 0.458 mmol). The mixture was heated to boiling under 
vigorous stirring followed by additional heating for 20 min. Subsequently, 50 ml 
of 34mM sodium citrate solution was added to the mixture. The aqueous solution 
was heated for 20 min followed by cooling to room temperature. As-prepared 
aqueous solution was directly used for the preparation of VAF films. In some 
cases, it was further concentrated to be used as the dipping solution for LBL films. 
For the concentration process, the as-made aqueous gold nanoparticles dispersions 
were transferred into 50-ml centrifuge tubes and the concentration of the solutions 
was increased using a Sorvall Legend Mach 1.6R centrifuge (Thermoscientific, USA) 
at the speed of 10,000 r.p.m. for 1 h. Less than 5 ml of concentrated gold-nanoparticle 
solution was settled at the bottom of the tube. The supernatant (45 ml) was removed 
by suction pipettes, while taking special care to prevent disturbing the sedimented 
nanoparticles. Finally, the resultant nanoparticles were collected for the LBL process. 
Preparation of LBL films. Glass slides (25mm X 75mm, Fisher Scientific) 
cleaned by piranha solution (sulphuric acid and hydrogen peroxide, from Sigma- 
Aldrich, in a 3:1 volume ratio) overnight were dipped into a 1 vol.% solution of 
polyurethane, for 5 min, rinsed with deionized water for 1 min and gently dried with 
compressed air. Please note that the piranha solution is dangerous and extremely 
reactive with organic substances so appropriate handling precautions are essential. 
Then the slides were dipped into a dispersion of concentrated gold nanoparticles 
for several minutes, rinsed for 1 min, and again dried with compressed air. After 
achieving the desired thickness, free-standing films were isolated by etching of the 
glass slides with 1 vol.% hydrofluoric acid (Sigma-Aldrich) solution. Note that 
hydrofluoric acid solution, even diluted, is very toxic so extreme precautions must 
be taken at all times. After thorough rinsing with water, the detached films were 
dried in an oven at 60°C overnight and then kept in ambient conditions (room 
temperature and relative humidity of 20-30%) for another night before measure- 
ment of properties. 

Conditions and procedures for preparation of LBL films with various gold 
contents were the same except for the dipping time to achieve the dispersion of 
concentrated gold nanoparticles. LBL films containing 15.4 vol.%, 16.2 vol.%, 
16.9 vol.%, 19.4 vol.%, 21.7 vol.%, and 24.0 vol.% of gold nanoparticles were pre- 
pared with dipping times of 1 min, 2 min, 3 min, 8min, 14min and 20min, 
respectively. When the substrate is dipped into the gold-nanoparticle dispersion, 
tiny amounts of polyurethanes, which might be weakly bonded, are slowly diffused 
into the dispersion of gold nanoparticles. In the process of cyclic deposition, 
polyurethanes are accumulated in the dispersion of gold nanoparticles and when 
they exceed the specific threshold of concentration, gold nanoparticles are floccu- 
lated. To maintain the homogeneity of the LBL film, careful monitoring of floc- 
culation is needed. If flocculation is observed, the dispersion of gold nanoparticles 
must be changed. An LBL thin film of (PU/PAA)3(PU/NP)4(PU/PAA), was pre- 
pared for TEM experiments. Detailed methods for film preparation were exactly 
the same as the above typical process. The dipping time of the glass slide into PAA 
was 5 min. 

Preparation of VAF films. To a one-litre glass beaker containing a litre of as- 
prepared gold-nanoparticle dispersion (no additional concentration steps were 
applied) equipped with a magnetic stir bar we slowly added 1 ml of 1.0 vol.% 
aqueous polyurethane. The mixture was stirred for 15 min followed by filtration. 
Filter papers of 0.8-1m pore size with 47 mm diameter, and the filtration assembly, 
were obtained from Fisher Scientific. The resultant gold-coloured film was peeled off 
the filter paper. We recommend drying the film completely before taking it out from 
the filtration assembly. Individual peeled-off films have some fibrous material 


attached that comes from the filter paper, which can be removed by gentle scratching. 
Conditions and procedures for preparation of VAF films with various gold contents 
were the same except for the volumetric ratio of a dispersion of as-prepared gold- 
nanoparticle solutions. VAF films containing 15.9 vol.%, 16.4 vol.%, 17.3 vol.%, 
18.0 vol.%, 19.1 vol.%, 21.0 vol.%, 21.7 vol.%, 22.8 vol.% and 23.4 vol.% of gold nano- 
particles were prepared with 250 ml, 500 ml, 650 ml, 700 ml, 800 ml, 900 ml, 1,000 ml, 
1,050 ml and 1,100 ml of as-prepared gold-nanoparticle dispersion, respectively. To 
make a 30-{um-thick VAF film, in total about 1,800 ml of solution needs to be filtered. 
Characterizations. The initial stages of LBL deposition were monitored using an 
8453 ultraviolet-visual Chem Station spectrophotometer (Agilent Technologies, 
USA). The thickness of the initial assembly of the LBL film was calculated using a 
BASE-160 Spectroscopic Ellipsometer (J. A. Woollam, USA). Calculations were 
fitted using a Cauchy’s model. The LBL film for ellipsometry was prepared on 
silicon wafers following the same procedure as described for the LBL film. 

Stress-strain curves were obtained by testing ~1-mm-wide and 5-7-mm-long 
rectangular strips of samples with a mechanical strength tester 100Q (TestResources, 
USA). Tests were performed at a rate of 0.01 mms’ for LBL films and 0.08 mm s"* 
for VAF films with a ~111-N range load cell. Five samples were tested for each film. 
The force measurements from the load cell were divided by the measured initial 
thickness and initial width of the sample gauge section to give nominal stress. This 
procedure results in a plot of nominal stress versus nominal strain. The cyclic tensile 
test (Fig. 2g) was performed by a Servopneumatic Axial/Torsion Test Instrument 
with custom-made grips (EnduraTEC, USA). Tests were performed at a rate of 
0.01 mms’. 

Conductivity measurements with two-probe and four-probe methods were 
obtained using an 34401A Digital Multimeter, 6 '/2 Digit (Agilent Technologies, 
USA). Measurements of conductivity as a function of strain (Fig. 2g) were done 
using a custom four-probe setup, depicted in Supplementary Fig. 12. The mea- 
sured resistance and applied tensile strain were recorded simultaneously by a video 
camera, and close-up images of the sample were photographed by a high-speed 
camera to calculate the thickness of the sample at some strain level with the assum- 
ption that the volume of sample was kept constant. The temperature dependence 
of conductivity was measured using the four-probe method with a Magnetic 
Property Measurement System (MPMS) (Quantum Design, USA). The MPMS 
is a cryogenic probe that integrates a superconducting magnet with a supercon- 
ducting quantum interference device (SQUID) detection system and a temper- 
ature controller in a specially designed Dewar. These sophisticated configurations 
provide rapid precision measurements over a temperature range from 2 to 300 K. 

Filler fraction of sample was determined by a Pyris 1 TGA (PerkinElmer, USA) 
with a temperature ramp from 25°C to 800°C at 10°C per min under an air 
atmosphere at a flow rate of 20 ml per min. The differential scanning calorimetry 
analysis was performed using a Q200 (TA Instruments, USA). SEM images were 
obtained with an FEI Nova Nanolab dual-beam focused ion beam and scanning 
electron microscope. TEM images were obtained with a JEOL 2010F analytical 
electron microscope. High-voltage electron microscopy images were obtained 
with a high-voltage electron microscope (1,250kV, JEOL, JEM-ARM 1300S). 
Atomic-force-microscopy imaging was performed with a Nanoscope III atomic 
force microscope (Digital Instruments/Veeco Metrology Group). X-ray scattering 
data were obtained with a Bruker NanoStar Small-Angle X-ray Scattering System 
equipped with Siemens Kristalloflex 770 X-Ray Generator, and a Cu Ke radiation 
(0.1542 nm) X-ray tube with peak power of 1.5 kW on a Hi Star Area 2D detector 
with 1 min of exposure time. The damping parameter tano and the storage modu- 
lus of the samples were obtained with an RSA 3 dynamic mechanical analyser (TA 
Instruments) with applied ¢ = 0.5% within the frequency range of 1-90 Hz. A load 
cell with a maximum capacity of 3.5 N and a resolution of 1 .N was used. 


©2013 Macmillan Publishers Limited. All rights reserved 


1 sid ial Be 


doi:10.1038/nature12363 


Seasonal sea surface cooling in the equatorial Pacific 
cold tongue controlled by ocean mixing 


James N. Moum!, Alexander Perlin't, Jonathan D. Nash! & Michael J. McPhaden? 


Sea surface temperature (SST) is a critical control on the atmosphere’, 
and numerical models of atmosphere-ocean circulation emphasize 
its accurate prediction. Yet many models demonstrate large, system- 
atic biases in simulated SST in the equatorial ‘cold tongues’ (expan- 
sive regions of net heat uptake from the atmosphere) of the Atlantic” 
and Pacific’ oceans, particularly with regard to a central but little- 
understood feature of tropical oceans: a strong seasonal cycle. The 
biases may be related to the inability of models to constrain turbulent 
mixing realistically’, given that turbulent mixing, combined with 
seasonal variations in atmospheric heating, determines SST. In tem- 
perate oceans, the seasonal SST cycle is clearly related to varying solar 
heating’; in the tropics, however, SSTs vary seasonally in the absence 
of similar variations in solar inputs’. Turbulent mixing has long been 
a likely explanation, but firm, long-term observational evidence has 
been absent. Here we show the existence of a distinctive seasonal cycle 
of subsurface cooling via mixing in the equatorial Pacific cold tongue, 
using multi-year measurements of turbulence in the ocean. In boreal 
spring, SST rises by 2 kelvin when heating of the upper ocean by the 
atmosphere exceeds cooling by mixing from below. In boreal sum- 
mer, SST decreases because cooling from below exceeds heating from 
above. When the effects of lateral advection are considered, the mag- 
nitude of summer cooling via mixing (4 kelvin per month) is equi- 
valent to that required to counter the heating terms. These results 
provide quantitative assessment of how mixing varies on timescales 
longer than a few weeks, clearly showing its controlling influence on 
seasonal cooling of SST in a critical oceanic regime. 

Equatorial cold tongues in the Atlantic and Pacific are formed in part 
through diverging horizontal transport near the sea surface associated 
with large-scale wind patterns, which bring cool waters towards the ocean 
surface’*. Maintaining cool SSTs in the presence of intense solar heating 
requires a combination of subsurface mixing and vertical advection to 
transport surface heat downward*’""". Limited measurements during the 
passage of a tropical instability wave (TIW) at 0, 140° W revealed a tenfold 
increase in subsurface turbulent heat flux'’. This was sufficient to cool 
surface waters by 2 K per month, indicating that mixing alone has the 
potential to maintain the equatorial cold tongue, a conjecture supported 
by climatological evidence that sea surface cooling is enhanced by TIWs”. 

Establishing general relationships that show how irreversible tur- 
bulent mixing influences patterns in large-scale (10,000 km) ocean 
phenomena has been notoriously difficult for two reasons. First, the 
delicate measurements required to quantify mixing have been limited to 
short shipboard campaigns (30 days) that target specific processes'*"””. 
Although these have led to fundamental advances in our knowledge 
of ocean dynamics at a few locations and on timescales of a few weeks, 
such measurements are unable to determine whether changes in mix- 
ing on seasonal and interannual timescales might influence large-scale 
dynamics. Second, diagnostics of mixing in numerical model solutions 
of ocean circulation are ambiguous, because models grossly oversimp- 
lify their representation of mixing, which occurs on scales that are 
orders of magnitude smaller than the smallest spatial and shortest 
timescales resolved. 


Here we directly quantify variations in mixing on long time- 
scales using specialized instrumentation that we developed for use on 
oceanographic moorings'*’”. Since late 2005, these sensors have pro- 
vided near-continuous measurements of mixing below the Tropical 
Atmosphere—Ocean (TAO) buoy maintained by the National Ocean 
and Atmospheric Administration (NOAA) at 0, 140° W”°. 
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Figure 1 | Six-year record of mixing at the TAO mooring at 0, 140° W. 

a, Nifio3.4 SST index, a measure of the relative strengths of El Nifo and La Nifia 
events’. b, SST. c, Wind stress, t,, (7-day averages). d, Squared meridional 
velocity at 40 m filtered at 12-33 days, taken as a proxy for kinetic energy in the 
tropical instability wave frequency band (TIW KE). e, Turbulent heat flux 
averaged over depths 20-60 m, <J,>20-6om- f, Image plot of zonal currents 
(zonal velocity denoted u, eastward currents red, westward blue; see right inset 
colour scale). Coloured bars show depths and durations of y-pod deployments; 
logio Jy is indicated by the colour (10-day average, see left inset colour scale). 
Arrows at top of f show deployments (red) and recoveries (black) of moorings. 
Grey bars show period of comparison experiment with shipboard turbulence 
profiling instrumentation’*””. 
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This first multi-year time series of mixing is summarized in Fig. 1f 
together with other measures of larger-scale background influences 
(Fig. la—d). Considered together, these suggest patterns in mixing that 
span multiple depth ranges on timescales of weeks to months, contribu- 
ting to a cyclic pattern of mixing on the seasonal timescale (Fig. 1f), and 
hint at longer-timescale patterns (for example, mixing is weaker during 
El Nino events than during La Nina events or neutral periods; Fig. la, e). 
Short-timescale variations associated with shear instabilities*’”, daily 
cooling’*’ and weekly variations in surface winds’ are hidden by our 
10-day averaging. Our intent here is to establish the role of mixing on 
the seasonal cycle. 

As a metric to quantify long-term patterns, we compute the tur- 
bulent heat flux Oe coloured bars in Fig. 1f and defined in Methods) 
averaged over 20-60 m depth (<J4>20-60m; Fig. le). Over this depth 
range, turbulence fluctuations are closely related to coupled air-sea 
interactions, and to strong velocity gradients formed between the west- 
ward flowing near-surface South Equatorial Current (SEC) and the 
core of the Equatorial Undercurrent (maximum eastward flow near 
100 m; 95% confidence interval CI = 70-145 m) are marginally stable, 
offering an additional energy source for mixing. 

From monthly-averaged data, <J;>20.6om is positively correlated 
with wind stress (r = 0.43, 95% CI = 0.15-0.64, P-value P = 0.004) and 
TIW kinetic energy (KE; r = 0.59, 95% CI = 0.35-0.75, P = 0.00002). 
However, it is not significantly correlated, on a monthly average, with 
any available measure of vertical stratification, shear, or their ratio, 
Ri (the gradient Richardson number, which defines the instability 
condition for stratified shear flows). Such correlations do exist in 
high-resolution measurements from targeted process experiments*’”’, 
suggesting that the averaging over long periods hides the mechanistic 
relationships that drive the turbulence. Regardless, strong seasonal 
cycles exist in all flow properties of the equatorial cold tongue 
(Fig. 2), including clear correspondences between Tq shear and strati- 
fication. Warming of the upper 50 m in boreal spring coincides with 
reduced winds, weakening or disappearance of the westward flowing 
SEC, and reduced TIW KE; J, is diminished throughout the water 
column. Increasing winds and TIW KE occur in late summer, follow- 
ing which the SEC reappears, the layer of high shear deepens, J, 
increases and surface waters cool. Despite continued high wind stress 
(ty) in September, J, weakens throughout the upper water column, as 


does velocity shear in the upper 60m. High t, and high TIW KE 
accompany a secondary maximum in J, late in the year. 

The quantity <J,>20-6om (Fig. 3a) exhibits an annual cycle that is 
significant at the 95% confidence level. Minima in April-May are a 
factor of 6 smaller than the maximum in August. The smallest values of 
<Jq>20-60om coincide with rising or maximum SST and the largest 
values with cooling SST. 

We examine the heat budget of the surface mixed layer (with depth h) 
to illustrate the importance of J, to changes in SST. The net surface flux 
acting to heat or cool the mixed layer is J; — 0 — J" (Fig, 3a), where Ji ° is the 
total surface flux from atmosphere to ocean and I" is the intensity of 
short wave radiation penetrating below h. J° is determined from the 
TropFlux analysis®, I" is derived from an empirical formulation™, and h 
is determined from hourly temperature profile measurements on the 
TAO mooring (Supplementary Information). In the annual cycle, 
l= r pas in late boreal winter with a minimum in late boreal 
summer’. The critical seasonal modulation of J° — I" is due in equal 
parts to modulations of solar radiation and latent cooling (a seasonal 
cycle of atmospheric flux components is included in Supplementary 
Information) but is partially offset by I", which is diminished in boreal 
summer when winds and mixing tend to increase h. 

The heating and cooling rate of a fluid parcel due solely to vertical 
mixing is pC,dT/dt = —dJ,/dz, where p, C, and T are respectively the 
density, heat capacity and temperature of the fluid parcel, t is time and 
dJ,/dz the vertical heat flux divergence across the fluid parcel. Across the 

surface mixed layer dJ,/dz = oe yh Jh)Ih, where is is the turbulent 
heat flux acting across the mixed] layer base. Ifwe assume that < Ja 20. 60m 
is a reasonable estimator of J", then the sign of <J,>20-60m — ( ib —T")in 
Fig. 3a ought to reflect the sign of the observed cooling rate, “ASST/dt. 
During February to May, J; °_ yh > J", coincident with maximum sea 
surface heating. Conversely, Hon. June to September, Ja O_ phic Jes h  coin- 
ciding with maximum sea surface cooling. The maximum observed cool- 
ing rate is <1 K month ' while the heat flux divergence based on mixing 
alone contributes a maximum cooling of 4 Kmonth ' (Fig. 3b). 

The surface mixed layer heat budget discussed above considered a 
one-dimensional vertical balance. However, at the fixed point of obser- 
vation, SST is also changed by horizontal advection’: gradients in 
temperature transported by surface currents cause a local change in 
temperature (see Supplementary Information). The sign of the zonal 
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advection (uT,,) varies with that of the zonal surface currents (u) 
because the negative sign of the zonal SST gradient (T,) is relatively 
constant. Meridional advection (vT)), largely due to TIWs, is a net heat 
source (Fig. 3b). Vertical advection (upwelling) brings cool fluid nearer 
the surface but mixing is required to modify SST. The sum of heat flux 
divergence and advection balances the observed heating rate (within 
95% confidence limits) during strong sea surface cooling (July- 
September), but indicates greater heating than observed at other times. 

Vertical profiles of J’ n(@)s averaged on seasonal timescales (Fig. 4), 
show that the heat flux during June-August has significant vertical 
structure coincident with minimal ie —I", suggesting that the flux 
divergence acting across the sea surface may actually be larger than 
shown in Fig. 3b. It also emphasizes the distinctively weak turbulence 
throughout the upper 70 m in March—-May, the period coinciding with 
maximal He —y', 
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Figure 3 Annual cycles of SST and turbulence at 0, 140° W. a, SST (black), 
<Jq>20-60m (blue) and that part of the surface heat flux that contributes to 
mixed layer heating ( 1 —I") from TropFlux (red; http://www.locean- 
ipsl.upmc.fr/tropflux/). Averages are derived from the observations in Fig. 1. 
Error bars on SST and J? —I'are 95% confidence limits on bootstrap estimates 
of means from daily values. Error bars on individual values of <J,>20.6om 
represent the range based on two independent estimates of T,. One estimate is 
computed from the temperature profile the sensor measures as it profiles the 
water column by surface-wave-induced motion coupled through the surface 
float. The second estimate uses temperature profiles from auxiliary temperature 
sensors distributed about that y-pod on the mooring, averaged hourly in time 
and differenced vertically over 10-20 m. The value of J, plotted is the average of 
these two estimates. Natural variability is represented by 95% bootstrap 
confidence intervals on 15-min averages of J, for the largest (August) value by 
the adjacent blue bar, and for all of the data by the blue bar adjacent to the 
December value. Details are included in Supplementary Information. b, Terms 
contributing to SST evolution. Black line, dSST/dé; blue, 1/ (pCp)dJq/ dz; red, 
uT, green, vT,; grey, 1/(pC,)dJ,/dz + uT,,+vT,. Advection terms uT,, vT, were 
computed from surface currents measured at the mooring and a revised SST 
product”. Uncertainties (shaded regions) are 95% bootstrap confidence limits 
on means computed from 30-day averages. Confidence limits on the turbulent 
heat flux divergence are based on reasonable ranges of h (Supplementary 
Information). 
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The finding that J 0 _ Th and J" are each in quadrature with SST but 
out-of-phase with each other is a critical factor in defining the seasonal 
cycle of SST. The strong correlation between mixing and wind stress 
on the annual cycle, together with correlations found over shorter 
periods'*, suggests the importance of the (predominantly easterly) 
wind in enhancing westward currents, thereby increasing velocity gra- 
dients and promoting shear instability, which is likely to be the root 
cause of turbulence in the upper equatorial ocean*’”*”*. Higher winds 
also increase latent cooling, decreasing i —I" at the same times that 
mixing peaks. 

These are, to our knowledge, the first long-term mixing measure- 
ments in the ocean that resolve the temporal intermittency and multi- 
year variability at a point of observation. Figure 3a clearly shows that 
observed seasonal sea surface heating and cooling coincide with the 
correct sign of the air-sea heat flux divergence. Subsurface mixing is 
the dominant sea surface cooling term and controls the SST cooling 
phase, but provides too little cooling to counter surface heating from 
the atmosphere plus advection during some parts of the year. It is 
possible that, at these times, the governing heat flux divergence is 
shallower than 20m, as suggested by Fig. 4. It is also possible that 
spatial intermittency is important because the measured temperature 
of a fluid parcel is controlled by the cumulative sum of the mixing 
processes along its trajectory. Spatial gradients in temperature and 
currents are large, especially across the Equator. Gradients in mixing 
are unknown, but if they are proportionately large, then fluid parcels at 
the point of observation may have experienced a different history of 
mixing than observed. 

This finding—that equatorial mixing is organized in a meaningful 
seasonal cycle—shows that the ocean’s microstructure (on scales of 
seconds to minutes and centimetres to metres) is clearly linked to, and 
at times dominates, particular aspects of the larger-scale dynamics. 
Extended measurements like these are intended to provide quantita- 
tive estimates of mixing on interannual timescales to improve our 
understanding of the El] Nino/Southern Oscillation cycle and help to 
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Figure 4 | Seasonally averaged vertical profiles of turbulence heat flux. 
Curves are shown for December-February (DJF); March-May (MAM); June- 
August (JJA) and September-November (SON). Positive heat flux indicates 
heating from above. Shading indicates 95% bootstrapped confidence limits on 
means based on 15-min averages. At top, values of is —I' in Wm 7 are shown 
as vertical bars. 
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reduce the equatorial Pacific cold tongue bias that affects the mean 
state of many coupled ocean-atmosphere climate models’. 


METHODS SUMMARY 


The vertical (more strictly, diapycnal) flux of heat by turbulence is represented 
by Fickian diffusion, enhanced by a turbulence diffusion coefficient, K, = 77/ T2 
(ref. 27). The temperature variance dissipation rate (7) is computed by scaling the 
temperature gradient spectrum measured by fast thermistors on small autonom- 
ous instruments (‘y-pods’)'* attached to oceanographic moorings and packaged 
with inertial navigation units to quantify the component of flow speed past the 
sensor that is due to wave-induced motions of the surface float'*. The other 
component of the flow speed is the ocean current speed, measured separately by 
velocity sensors on the mooring. The vertical temperature gradient, T_, is defined 
locally by the vertical motion of the sensor through the water or, on a larger scale, 
by additional temperature sensors on the mooring. The depth-dependent vertical 
heat flux caused by turbulence is J, = —pC,K,T-. 
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Feeding andesitic eruptions with a high-speed 
connection from the mantle 


Philipp Ruprecht! & Terry Plank! 


Convergent margin volcanism is ultimately fed by magmas gener- 
ated in the mantle, but the connection between the mantle and the 
eruption at the surface is typically obscured by cooling, crystalliza- 
tion and magma mixing within the crust’*. Geophysical techni- 
ques are also not very effective in the lower and middle crust, where 
seismic events are rare and resolution is generally poor*”. It has 
thus been unclear how fast mantle-derived magmas transit the crust 
and recharge crustal magma chambers. Here we use diffusion mod- 
elling of nickel zonation profiles in primitive olivines from diverse 
primary melts® ’° to show how mantle recharge may occur on time- 
scales as short as eruptions themselves. In Irazu volcano in Costa 
Rica, magmas apparently ascend from their source region in the 
mantle through crust about 35 kilometres thick in just months to 
years, recharging hybrid basaltic andesites over the course of the 
eruption. These results show that large stratovolcanoes with shal- 
low magma chambers’ may still preserve the deep record of their 
mantle origin in olivine crystals. This approach—documenting 
magma ascent timescales from the mantle beneath a convergent mar- 
gin stratovolcano—can be applied to other eruptions that record 
magma mixing with recharge melts. Signs of volcanic unrest are 
typically monitored at the surface or upper crust; new efforts should 
look deeper, tracking magma movement from the base of the crust 
to the surface in the months to years before eruptions. 

The 1963-65 eruption of Irazu volcano, the largest stratovolcano in 
the Central American arc'’, produced basaltic andesites that contain a 
remarkable diversity of olivine crystals, many of which still preserve 
chemical vestiges of their mantle origin. Nickel (Ni) and forsterite (Fo) 
variations (measured by laser-ablation inductively coupled plasma 
mass spectrometry, LA-ICPMS; see Methods) in olivines from the sin- 
gle 1963-65 eruption record as much variation as global and regional 
data sets*”'®'* (Fig. 1). In particular, primitive magnesian olivines 
(>Fogg) have Ni concentrations that vary by a factor of about two. 
Such Ni variations at high, constant Fo contents are thought to be 
created during mantle melting—resulting from melting of distinct 
mantle lithologies (for example, pyroxenitic versus peridotitic mantle) 
and melting at distinct depths® ’°. The large range in Ni concentrations 
at constant Fo content implies that distinct batches of mantle melts are 
supplied to Irazu volcano—even within a single eruption. The actual 
process that creates these variations is not as critical to this study as the 
timescale over which those variations would be erased. Upon crystal- 
lization, the Ni concentration in melt is controlled predominantly by 
growth of olivine alone. Other silicate phases (plagioclase and pyrox- 
enes) later join the crystallization sequence and drive a shallower Ni- 
Fo trajectory (Fig. 1). 

Such an evolutionary path of the magma is supported by other trace- 
element variations in olivine (see Supplementary Fig. 5 for Ca, Sc, Ti 
variations with Ni content). High Ni concentrations at intermediate Fo 
content reflect magma hybridization at shallow crustal levels during 
mixing of primitive mantle-derived magmas with evolved resident 
magmas and assimilation of foreign crystals (that is, xenocrysts; see 
Supplementary Information). Although shallow magma processes (such 
as magma mixing or prolonged residence in hybrid magmas) are 


recorded in the majority of the olivine crystals, we focus on the mag- 
nesian olivines with variable Ni content because they provide insight 
into mantle recharge timescales. 

We consider the magnesian olivines that are common in hybrid 
magmas of Irazu volcano’’ to be magmatic in origin and true pheno- 
crysts (that is, crystals growing from the ascending primitive mantle 
melt that later mixes with resident magma in the shallow plumbing 
system). Their phenocrystic character is indicated by their texture (well- 
formed crystals with sharp, continuous crystal faces typical of growth 
from a melt), and trace-element compositions (see Supplementary 
Information). Even if they are mantle xenocrysts, however, they still 
constrain the timescale of ascent from the mantle. 

Most olivines from the 1963-65 Irazu eruption record growth in a 
closed system magma, with melt evolving along a liquid line of descent 
(Fig. 2a, b). Some olivines (~15% of all analysed) show a more com- 
plex history of mixing between distinct primitive melts, with Ni zona- 
tion crossing multiple olivine fractionation curves (Fig. 2c, d). Retention 
of such distinct Ni zonation requires that mixing of these mantle melts 
must have been faster than the timescales of diffusive re-equilibration. 
Given the high Fo contents of the olivines that record this Ni zonation 
(many over 90% forsterite, Foo9, which is typical of mantle olivine), 
this mixing process requires deep crystallization in or near the mantle, 
or not far removed in time from the mantle history. Indeed, thermal 
arguments show how any crustal residence before mixing must be very 
short (less than a month) to both preserve the Ni zonation and prevent 
a decrease in Fo (see Supplementary Information). 

Experimental studies constrain Ni diffusion in olivine’®* and ena- 
ble us to calculate mixing-to-eruption timescales for these mantle melts 
(see Methods Summary). Some individual olivine crystals are rever- 
sely zoned in Ni, with reversals up to 1,000 p.p.m. at only slightly vary- 
ing Fo (Fig. 3). In some cases the reversals correlate with chromium 
(Cr) and phosphorus (P) reversal, which put additional constraints on 
the crystallization and diffusion history of these olivines. Although 
diffusion rates of trivalent Cr in olivine are currently debated, P dif- 
fusion is known to be slow compared to Ni and it therefore records 
growth zoning’*. Zonation bands that are narrower in P than Ni 
(Fig. 3d, f) indicate some diffusive re-equilibration of Ni, and this 
zonation can be used to constrain the mixing timescales of the prim- 
itive melts before eruption. Cr does not always correlate with Ni because 
(1) Ni possesses greater primary variation and (2) Cr in olivine may be 
affected by coprecipitation of spinels”. 

Mixing-to-eruption timescales derived from one-dimensional Ni 
zoning profiles are short (months to a few years; see Fig. 4, Supplemen- 
tary Table 1 and Methods for details of the calculation procedure). Cal- 
culated timescales are maximum estimates, whereas a Monte Carlo 
approach is used to generate probability density functions (PDFs) and 
estimate uncertainties. Maximum timescales of mixing before erup- 
tion are derived for 11 olivines with Ni reversals unambiguously related 
to mixing of two distinct mantle melts. Using a Ni diffusion coefficient 
(Dy) at 1,200 °C (appropriate for the primitive melts that host these 
magnesian olivines) leads to average timescales of 0.6*§" years and 
0.3733 years for the earlier (IZ-10-11) and the later (IZ-10-13) 
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Figure 1 | Compositional diversity in olivine phenocrysts. a, Ni and 
forsterite contents measured by LA-ICPMS for 176 core-to-rim traverses (6,398 
data points) in olivine phenocrysts from the 1963-65 Irazu eruption. Green and 
red fields are olivines calculated to be in equilibrium with pyroxenite and 
peridotite source compositions taken from ref. 10 as one possible source of 
primary melt variability. Fractionation curves (solid and dashed curves) for 
different primary melt compositions encompass the compositional diversity 
recorded in the most primitive olivines (>Fogg). Tick marks are 1% increments 
of olivine crystallization. External reproducibility is estimated by analysis of 
olivine grains of San Carlos USNM 111312/44 (SC-ol). The major-element 
fractionation path passes through melt inclusion compositions from ref. 26 and 
additional melt inclusions reported here. The high- and low-Ni fractionation 
curves assume 302 p.p.m. Ni at 8.08 wt% MgO and 247 p.p.m. at 9.53 wt% MgO 
in the melt, respectively. Each contour varies by an added 16.7% Ni in the melt 


erupted samples (see sample description for details of the stratigraphy). 
Given that calculated timescales are short compared to the time of 
volcanic activity (approximately 2 years), mantle-derived magmas 
were actively mixing deep in the plumbing system while the eruption 
was already in full swing and magma transit through the crust may 
have sped up as the eruption progressed—potentially as continuous 
magma extraction from mantle and ascent led to the establishment of 
a more transmissible crustal pathway over time. This raises the ques- 
tion of where the magmas mix and whether we can use such mixing 
records to extract transit times from the mantle to the surface of this 
large, active convergent margin stratovolcano. 

The non-primitive character of many arc magmas, phase equilibria 
constraints combined with thermal considerations’, as well as the geo- 
barometry of gabbroic cumulates and volcanic mineral assemblages”, 
all suggest that mantle-derived melts may crystallize over a wide pres- 
sure range (typically 4-10 kbar) in the crust. In the case of Central America 
this pressure range represents the entire crustal column, assuming an 
average crustal density of 2,830kgm * and that the Mohorovici¢ 
discontinuity (Moho) is 35 km deep’’. Furthermore, the fact that the 
primitive Irazu olivines are >Fogg suggests that they are the Mg-rich 
liquidus phase and therefore probably represent the earliest silicate 
crystallization, close to the mantle-crust interface, because the melts 
would evolve to low Mg/Fe ratios upon ascent, transient stalling and 


Forsterite content (mol.%) 


compared to the low-Ni liquid line of decent. At about 1,000 p.p.m. Ni the data 
diverge from the simple fractionation curves and follow a shallower trajectory 
associated with the shallow co-crystallization of plagioclase, pyroxene and 
olivine (see Supplementary Data for additional trace elements supporting this 
interpretation, and Supplementary Fig. 5). The vertical dotted line is the olivine 
composition in equilibrium with whole rocks"! from the 1963-65 eruption 
assuming a partition coefficient Kp of 0.3. b, Irazu data (yellow shading) 
compared to the typical compositional variations observed in arc magmas 
represented here by data from Mexico”"® (MX, hatched, dotted outline) and 
Daisen Volcano in Japan" (JP, purple). c, Irazu data (yellow shading) compared 
to global compilations® for peridotites (brown), mid-ocean-ridge-basalt 
olivines (MORB, orange), Hawaiian olivines (HI, hatched, dotted outline), and 
other ocean-island-basalt olivines (OIB, blue). 


cooling (see Supplementary Discussion). Thermodynamic” and pseudo- 
liquidus” phase calculations confirm olivine as a liquidus phase for 
pressures over 9 kbar, potentially co-crystallizing with orthopyroxene. 
However, the absence of high-magnesium-number (Mg#) orthopyr- 
oxene suggests that olivine is the primary liquidus phase throughout 
the crustal column. 

Assuming magnesian olivine crystallization and mixing of primary 
mantle melts near the Moho, at 9 kbar depth (about 30 km deep), and 
given Ni diffusion timescales of the order of one year (Fig. 4), we cal- 
culate average crustal transport rates of 80 m per day for mantle mag- 
mas. Even if the upper 5-10 km are transited rapidly as part of the 
eruption process, lower and middle crustal transport rates would still 
be of the order of tens of meters per day (that is, 55 m per day, assum- 
ing 10 kmat >0.1 ms°_',as for explosive eruptions”). It is also possible 
that mantle melts mix in the crust, but the high Fo contents that vary 
little across Ni zonation profiles (for example, >Fog9 in Fig. 3c) place 
tight constraints on the timescales involved. For example, as little as 
35 °C cooling will drive olivine to drop one Fo unit”, and if this occurred 
before mixing, it would be observed. If melts ascend adiabatically, they 
may nearly parallel the olivine liquidus”, and thus could transit the 
crust chemically unmodified until they crystallize at the point of water- 
vapour saturation at 5-8km depth (for the 1723 Irazu eruption”*), 
where there is a clearly imaged low-velocity zone that probably represents 
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Figure 2 | Nickel and forsterite zoning in olivine from Irazu volcano. a, Ni 
versus forsterite variations for olivine phenocrysts that follow a single liquid 
line of descent. Colour coding in the crystal traverse ties the Ni-Fo data spatially 
to the image of the crystal. b, Conceptual evolution of olivines evolving along 
subparallel olivine fractionation trends. The two red curves show that crystals 
often diverge slightly from the calculated olivine-fractionation trend, 
suggesting minor melt mixing. The grey curves are the same fractionation 
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Figure 3 | Representative zonation profiles for three magnesian olivine 
phenocrysts. a, b, Sample IZ1011-ol4; c, d, Sample 1Z1013-o0122; e, f, Sample 
1Z1013-o0153. All olivines show Ni reversals that are modelled (blue line) by 
equation (1) and schematically displayed in Supplementary Fig. 8. All 11 
olivines are shown in Supplementary Figs 6 and 7. The mixing-to-eruption 
timescales represent the mode of the probability density functions (PDFs) 
derived via a Monte Carlo approach where the lower and upper bound are 
calculated for 1,200 °C and 1,100 °C, respectively. Full uncertainties are given in 
Supplementary Table 1. We note that the displayed timescales for the best-fit 
diffusion profiles (blue line, lowest root-mean-square value among all Monte 
Carlo calculations) are typically in agreement with the estimates from 
maximum-probability estimates (Fig. 4). Only olivines with less well-defined 
PDF peaks show a shift to longer timescales in the best-fit estimate. In some 
cases the Ni reversal coincides with Cr and P zonation, whereas P zonation 
typically shows a sharper gradient consistent with very limited diffusive 
reequilibration’’. Gaps in the Cr data are caused by partial ablation of Cr- 
spinels. The distance of crystal traverses is measured from the rim towards the 
centre of each grain. 
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curves for different primary melt compositions with tick marks at 1% 
crystallization increments as in Fig. 1. ¢, Ni versus forsterite variations for 
olivine phenocrysts with strongly varying Ni concentrations at primitive Fo 
content, suggesting the involvement of distinct mantle-derived melts. 

d, Conceptual evolution of Ni in an olivine, recording two distinct primitive 
mantle melt compositions (i and ii). 


a major magma storage zone beneath Irazu'?””. If this were the case, 
however, melts would ascend at even faster rates than the crustally 
averaged rates calculated above (55-80 m per day), because adiabatic 
ascent would require transport rates of over 1,000 m per day in a 2-m- 
wide, 25-km-long dike (see Supplementary Information). Thus, no 
matter at what depth the Ni clock is set, processes happen quickly 
before eruption. Mixing of mantle melts, ascent through the crust, 
hybridization in the upper crustal magma storage region, and final 
ascent in the conduit must all happen within months to years, within 
the two-year span of this eruption itself. 

Recent studies have reported earthquake swarms in the lower crust 
that are ascribed to magma movement****”’. High-resolution depth 
location of these events in the western USA and Iceland reveals sig- 
nificant upward migration between swarms, of the order ofa kilometre 
per week or per month, very similar to the rates estimated above for 
the 1963-65 Irazu eruption (that is, 50-100 m per day, or 1 km per 10- 
20 days). The comparison is not perfect, because most of these seismi- 
cally active systems did not erupt, whereas Irazu did, and Irazu olivines 
record an integrated ascent over the entire crustal column that could 
include very rapid transit punctuated by brief stalling on the way to the 
surface. Nonetheless, the crystal records are consistent with the seismic 
observations and provide a new constraint for magma transport from 
the mantle to the surface underneath large stratovolcanoes. 

Our results indicate that (1) distinct primary melts coexist and mix 
in the mantle beneath arc volcanoes. (2) These mantle-derived melts 
may stall only briefly in the crust with total crustal ascent timescales of 
the order of months to years. Thus, crustal processes” in some regions 
may be limited and our results establish (3) a direct connection between 
the mantle and shallow crustal magma chambers. Even though com- 
positionally distinct magmas may mix and hybridize completely’”, (4) 
olivines may retain chemical vestiges of their mantle history. A fairly 
homogenous basaltic andesite eruption like Irazu 1963-65 belies the 
rich diversity of olivines it contains, which record the magma’s inte- 
grated history from the mantle to eruption. This is not an isolated 
occurrence; magma mixing, mafic magma recharge, and high Fo olivi- 
nes are common to many stratovolcanoes above subduction zones*"’, 
and the approach we have outlined here may be applied generally. It 
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Figure 4 | Mantle melt mixing-to-eruption timescales for all analysed 
olivines. a, Relative PDFs for calculated mixing-to-eruption timescales at 
1,100 °C and 1,200 °C (note the different time axes for the two end-member 
temperatures, which vary by a factor of ~3.5). See Methods for additional 
information. Colours correspond to the different olivines in b. b, Compilation 
of the calculated timescales ordered by stratigraphic order (Supplementary 
Fig. 1) showing the 2-s confidence interval (coloured bars). Mode (diamonds) 
and mean (vertical bar) of the PDFs are shown. 1Z1013-0144 plots beyond the 
scale of the plot for both temperature calculations; the mode for the 1,100 °C 
and 1,200 °C calculations is 17.3 and 61.3 years, respectively (Supplementary 
Table 1). The five profiles shown for each crystal of each sample have been used 
to calculate average timescales (Avgi 109 and Avgi299) for the rapid mixing-to- 
eruption timescales observed in the Irazu data set. Olivines from the later 
erupted sample IZ-10-13 record on average shorter timescales, suggesting a 
speeding up of magma ascent as pathways in the crust became more open 
during the progress of magma extraction and eruption at the surface. 

c, Conceptual diagram illustrating melt mixing at Moho depth followed by fast 
ascent (over a months-to-year period) to the surface. At shallow levels the mafic 
magmas mix with resident evolved magma in the shallow crust. See text for 
additional information. Green arrows represent distinct mantle melts pooling 
at the base of the crust. Once extracted from the mantle the primitive mixed 
magma (red) ascends through the crust until it interacts with a more evolved 
hybrid magma (orange) and erupts as such hybrid magma. 


remains to be seen how many other stratovolcanoes like Irazu may be 
fed and possibly triggered by magma recharge directly from the man- 
tle. Seismically recorded deformation in lower and mid-crustal levels* 
may be the first precursor of such fast-ascending mantle-derived melts. 


METHODS SUMMARY 


The zoning of about 176 olivines from three tephra samples was studied using a 
combination of back-scattered electron microscopy and LA-ICPMS. Major- and 
minor-element compositions were analysed by electron microscopy (wavelength 
dispersive X-ray spectroscopy). Continuous zoning profiles of major (Fe-Mg) and 
trace elements were determined by LA-ICPMS. To account for diffusive aniso- 
tropy in olivine, lattice orientations of analysed olivine were characterized using 
electron back-scatter diffraction. Because the erupted magmas are hybrid in com- 
position, we estimated temperatures of the primitive end-members using a com- 
bination of thermodynamic models and olivine-melt thermometry. Ni diffusion 
calculations were based on an analytical solution for a confined-planar interior 
source with constant boundary conditions’ and uncertainty estimates for the time- 
scales of mixing to eruption were generated using Monte Carlo simulations (Fig. 4). 
Initial and boundary conditions as well as parameters varied for the Monte Carlo 
simulations are summarized in Supplementary Fig. 8. Ni diffusivity was taken from 
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refs 16, 17 and 33. Presented timescales for mixing to eruption are conservative 
(maximum) estimates due to the use of a one-dimensional model that neglects 
volume diffusive re-equilibration, the assumption that zoning profiles are purely 
diffusive (no simultaneous growth or spatial smearing by the laser ablation ana- 
lyses), and the low end of the temperature range (1,100-1,200 °C) chosen. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Electron microprobe analyses. Olivine mounts were imaged via back-scattered 
electrons with a Cameca $X100 electron microprobe at the American Museum of 
Natural History, New York. Quantitative crystal traverses and olivine-hosted melt 
inclusions were obtained by electron microprobe at the American Museum of 
Natural History using multiple spot analyses, performed at 15kV accelerating 
voltage, 15-20nA probe current, and beam size of 1-5 um. All elements (Si, Ti, 
Na, Mg, Fe, Mn, Cr, Al, P, K, Ca, Ni) were routinely measured for 20-30 s. Long 
counting times were only used when improved precision was necessary (that is, 
electron microprobe versus LA-ICPMS data comparison; Supplementary Fig. 4), 
in which counting times generally were extended to 60 s (Ti, Cr, Ni) and 120s (Al, 
Ca). To minimize Na loss we measured Na using sub-counting (ten steps), while 
recording the zero-time intercept. Reference materials (olivine 174.1 (ref. 34) and 
augite 164905 (ref. 35) were reproduced to better than +2% relative error (1c) for 
major elements (Mg# to better than +0.5%). Extended counting times for the 
comparison of electron microprobe and LA-ICPMS resulted in reproducible Ni 
(Ol174.1) and Cr (Aug16490) concentrations to within +5% and +3%, respect- 
ively. Olivine-hosted melt inclusions were analysed with a 10-j1m beam at 15-kV 
accelerating voltage with a composite measurement routine where Na was mea- 
sured first at 4nA for 20s with a zero-time intercept. Other major and minor 
elements (Al, Mg, Si, K, P, Ti, Ca, Fe, Mn) were measured next at 10nA with 
counting times varying between 20 s and 30s on peak. Cl, S, and Cr were measured 
last at 40 nA with counting times of 60 s, 80s and 30s, respectively. Most elements 
of our reference material (Mangatolu Rift-ND7001, n = 3) were reproducible to 
better than +4% (Si: 0.3%, Ti: 5.3%, Al: 0.3%, Cr: 7.9%, Fe: 1.4%, Mn: 22%, Mg: 
2.0%, Ca: 0.8%, Na: 8.3%, K: 2.2%, P: 55%, S: 3.2%, Cl: 5.6%). 

Laser-ablation inductively coupled mass spectrometry. Major- and trace-element 
traverses via LA-ICPMS were collected using a VG PQ ExCell and NewWave 196-nm 
ArF Eximer laser at Lamont-Doherty Earth Observatory. Olivine traverses were 
ablated as core-to-rim line scans using a 10-25-1m laser spot rastering across the 
crystal at ~3pums | with frequency of 15Hz and a power output of 70% 
(>10.5Jcm 7 and >2.0GWcm ”). The ablated material was transported in a 
He-Ar atmosphere to the ICPMS. Each traverse was pre-ablated with a 25-50-11m 
laser spot at 20% power output. Dwell times for major and minor elements in each 
cycle were 10 ms. Reference sample calibrations were performed using reference 
materials BHVO-1, BCR-1, and BIR-1. MgO served as internal reference element 
and data quality was monitored by measuring San Carlos olivine (USNM 111312/ 
44), Elemental concentrations were corrected by normalizing data either to publi- 
shed concentrations for elements with accepted values in San Carlos olivine'***°? 
or to an average calculated from the San Carlos measurements in our study that are 
within 1 s.d. of the average of all our San Carlos olivine runs (measurement sessions 
with large variations were omitted because there was a problem during the cross- 
calibration between pulse-counting and analogue mode that led to large uncertain- 
ties for some elements such as Ca, Sc and Mn; Supplementary Table 3). External 
reproducibility can be estimated from the scatter of individual San Carlos olivine 
traverses (Fig. 1). LA-ICPMS time series were processed with a Matlab code 
(developed by P.R.) that follows standard calibration techniques and calculates 
major- and trace-element concentrations for each individual time slice. To reduce 
the noise of the measured profiles, while providing detailed information on indi- 
vidual zones within each olivine, each reported datum is an average of five time 
slices (each slice 0.36s and ~1.1 1m, and so each datum is 1.8s and 5.4m). 
Reproducible results were obtained for a large suite of elements (Li, Mg, P, Ca, 
Sc, Ti, V, Cr, Mn, Fe, Co, Ni, Cu and Zn). 

Electron-backscattered diffraction. Electron-backscattered diffraction measure- 
ments to determine olivine crystal orientations were performed on a Zeiss Supra 
40VP scanning electron microscope, Oxford energy-dispersive X-ray spectroscopy 
and HLK electron back-scatter diffraction (EBSD) integrated system at the Marine 
Biological Laboratory in Woods Hole. Diffraction patterns were matched against 
a reference olivine lattice model and Euler angles were recorded for a rotation from 
the reference frame to the observed orientation. 

Ni diffusion modelling. We use the one-dimensional analytical solution for a con- 
fined source in an infinite medium with fixed boundary conditions” to estimate 
timescales since mixing, assuming isothermal and isobaric diffusion and therefore 
a constant diffusion coefficient D for Ni in olivine: 


C(x) = Couter + 0.5(Cinner = Couter)(erf] (L = x)/(2(Dt)?*)] 
+ erfl(L — x)/(2(Dt)°*)]) (1) 


where Couter 20d Cinner are the initial Ni concentration profiles after instantaneous 
growth, separating a high outer Ni zone from a low inner Ni core at the location L 
of the concentration step away from the rim. Ni diffusion coefficients are calculated 
for constant temperature and pressure and a specific crystal lattice orientation, 


LETTER 


where «, / and y represent angles between profile direction and the crystallogra- 
phic axes [100], [010] and [001], respectively'®!’. The crystal lattice orientation 
was determined by EBSD (see above). Diffusion coefficients were calculated for a 
conservative temperature range (1,100-1,200 °C) consistent with pMELTS” using 
olivine-liquidus temperatures of back-calculated melt inclusion compositions (with 
2-4 wt% HO; Supplementary Table 2) and with other geothermometry estimates 
for Irazii magmas’’. These temperatures are slightly higher than that (1,043- 
1,074 °C) previously reported for 1723 Irazt melt inclusions”*’, which are trapped 
in uniformly more iron-rich olivine compositions (<Fog7) than the olivines model- 
led here. Calculated at intermediate pressures (500 MPa and oxygen fugacity of 
3.2 X 10 '° Pa) the Ni diffusion coefficients range from 1.34-8.06 X 10° '*m?s"! 
(at 1,100 °C) to 4.75-28.5 X 107 !8§m?s~! (at 1,200 °C)!*!”53, Our timescale calcu- 
lations assume that zonation profiles are the result of diffusive re-equilibration only, 
occurring after instantaneous growth of a Ni-rich zone. Following this initial 
growth episode, we assume no additional growth occurred (until shallow fractional 
crystallization led to low-Fo rims) and Ni profile evolution was purely diffusive 
(that is, the chemical Pe number is small (<1) where Pe = VI/D, with V, 1 and D 
being the growth rate, the half-width of the Ni reversal, and the Ni diffusion 
coefficient, respectively"°). However, growth and diffusive re-equilibration may 
be synchronous where growth is more gradual, modifying the compositional gra- 
dients according to the moving boundary. For those large-Pe-number conditions, 
true diffusive timescales would be shorter because the half-width is widened by 
growth. Thus, our calculated timescales are maximum estimates. Comparison of 
existing Ni diffusion data for olivine'®'* suggests that Ni diffusion is known to 
within a factor of 2. We have chosen to calculate the Ni diffusion coefficient D from 
the studies'®'” that (1) give maximum timescales and (2) provide complete for- 
mulations for temperature and pressure dependence. 

Error analysis for timescale calculations. Ni zonation profiles are subject to vari- 
ous complications, which we try to exclude whenever possible to minimize the 
uncertainties of our timescale calculations. Thus, we limit our timescale analysis to 
olivines that pass a series of data quality tests. We exclude crystals from our calcu- 
lations (1) that lack lattice orientation measurements (via EBSD), (2) for which we 
have significant contamination from ablated mineral inclusions (that is, high Cr 
from Cr-spinel, correlating with entire high Ni reversal), (3) that have a Ni reversal 
only in more iron-rich olivines and extend beyond the ‘olivine fractionation 
corridor’ to lower Fo (<Fogg), and (4) that show only minor Ni variations (ANi 
<300p.p.m.). Furthermore, we limit our analysis to olivines with Ni reversals 
because Ni variations due to changes in melt compositions are readily spotted 
in those crystals. Such Ni variations are less apparent in normally zoned olivine 
phenocrysts. We avoided cases where olivines show correlated Ni reversals with Fo 
reversal and the resulting Ni-Fo variations are parallel to olivine fractionation 
curves for Irazu (Fig. 1). In these crystals, distinguishing diffusion from growth 
effects is less certain and mixing along a liquid line of descent may only reflect 
thermal mixing in a thermally zoned magma body*'”’. Given these restrictions, we 
find only 11 useful traverses (Fig. 4, Supplementary Figs 6 and 7; a total of 176 
olivine traverses were measured) that show primary variations in trace elements at 
high Fo content and can be used for estimating magma transfer timescales. This 
emphasizes that many crystals (>100) per sample have to be analysed to extract 
not only the mantle signature, but also the mixing relationships of such signatures, 
as most crystals do not preserve this early history and instead are overprinted (or 
crystallized later) by the shallow processes typical for arc andesites (such as at Irazt). 
A recent numerical study of crystal gathering and dispersal*’ shows how different 
crystals may record very different stages in a single mixing event, with some 
recording strong variations if they are near large chemical gradients, while others 
barely experience any significant compositional variation. Thus, continuous magma 
evolution and mixing of different batches quickly obscures the record of primitive 
mantle magma compositions. 

Besides the process-related uncertainties (that is, growth versus diffusion versus 
fractional crystallization), more quantifiable uncertainties exist regarding the length 
of the laser profile and the orientation of the laser traverse with respect to the oli- 
vine lattice orientation. The laser ablation stage speed was set at 3 tm s_'. Comparison 
of the resulting laser track length with length measurements from backscattered- 
electron microprobe images indicates that the true stage speed is in good agree- 
ment (within 5%) of the measured value. A 5% underestimation of the stage speed 
results in 5% longer diffusion length scales. EBSD measurements are subject to 
uncertainty owing to misalignment of the reference lattice with the measured dif- 
fraction pattern. Repeat measurements suggest that crystal orientation determi- 
nations are generally robust (better than +10°). 

PDFs are derived via a Monte Carlo approach. Best-fit mixing-to-eruption time- 
scales are calculated using equation (1) through minimization of the root mean 
square between model and measured concentration profiles (Fig. 3 and 4) while 
varying Couters Cinnen L, % B and y between acceptable bounds (Supplementary 
Table 4). Best-fit timescales for all trials are combined into PDFs for each olivine, 
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providing an estimate of the uncertainties of the calculated timescales. We vary (1) 
the three angles (as mentioned above) by +10”, (2) the post- and pre-mixing equi- 
librium Ni concentration by +100 p.p.m. around an average value of the observed 
step, and (3) the location of the initial step by +15 tm. A full summary of the diffe- 
rent parameters used for the Monte Carlo approach is given in Supplementary 
Fig. 8 and listed in Supplementary Table 4. PDFs for each olivine crystal provide 
an estimate for fixed pressure-temperature conditions of diffusive re-equilibration 
(that is, 500 MPa and 1,100 ° C); higher or lower assumed temperatures shift the 
results systematically to shorter or longer mixing timescales, respectively (Fig. 4). 
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The molecular logic for planarian regeneration along 
the anterior-posterior axis 


Yoshihiko Umesono>7+, Junichi Tasaki?, Yui Nishimura’, Martina Hrouda’, Eri Kawaguchi’, Shigenobu Yazawa’, 
Osamu Nishimura’, Kazutaka Hosoda’, Takeshi Inoue* & Kiyokazu Agata? 


The planarian Dugesia japonica can regenerate a complete 
individual from a head, trunk or tail fragment via activation of 
somatic pluripotent stem cells’*. About a century ago, Thomas 
Hunt Morgan attempted to explain the extraordinary regenerative 
ability of planarians by positing two opposing morphogenetic gra- 
dients of formative “head stuff” and “tail stuff’ along the anterior- 
posterior axis**. However, Morgan’s hypothesis remains open to 
debate. Here we show that extracellular signal-related kinase (ERK) 
and Wnt/B-catenin signalling pathways establish a solid frame- 
work for planarian regeneration. Our data suggest that ERK sig- 
nalling forms a spatial gradient in the anterior region during 
regeneration. The fibroblast growth factor receptor-like gene 
nou-darake’ (which serves as an output of ERK signalling in the 
differentiating head) and posteriorly biased B-catenin activity®* 
negatively regulate ERK signalling along the anterior—posterior 
axis in distinct manners, and thereby posteriorize regenerating 
tissues outside the head region to reconstruct a complete head- 
to-tail axis. On the basis of this knowledge about D. japonica, we 
proposed that B-catenin signalling is responsible for the lack of 
head-regenerative ability of tail fragments in the planarian 
Phagocata kawakatsui, and our confirmation thereof supports 
the notion that posterior B-catenin signalling negatively modulates 
the ERK signalling involved in anteriorization across planarian 
species. These findings suggest that ERK signalling has a pivotal 
role in triggering globally dynamic differentiation of stem cells ina 
head-to-tail sequence through a default program that promotes 
head tissue specification in the absence of posteriorizing signals. 
Thus, we have confirmed the broad outline of Morgan’s 
hypothesis, and refined it on the basis of our proposed default 
property of planarian stem cells. 

The planarian Dugesia japonica has four distinct body regions 
arranged in an anterior to posterior sequence: head containing a brain 
and eyes (H), prepharyngeal (Pr), pharyngeal containing a pharynx 
(Ph), and tail (T) regions*”-”” (Fig. 1a) and possesses high regenerative 
ability’’*. After amputation, anterior-facing wounds form a head 
blastema, whereas posterior-facing wounds form a tail blastema, a 
phenomenon known as regeneration polarity. Thereafter, the com- 
plete anterior—posterior pattern regenerates within a week by filling 
in the prepharyngeal and pharyngeal regions between the head and 
tail’*'*. This sequence of events requires appropriate activation of 
somatic pluripotent stem cells (neoblasts), the sole source of regen- 
erative cells’. 

Recent RNA interference’”'* (RNAi) studies revealed that Wnt/B- 
catenin signalling regulates regeneration polarity in planarians**”. 
RNAi of planarian /-catenin caused transformation of the tail region 
into a head-like region®””’. B-catenin presumably forms a decreasing 
tail-to-head activity gradient as a result of the expression of posterior 
Wnt genes’, as inferred from the expression pattern of DjAbd-Ba, a 
target Hox gene of posterior Wnt/B-catenin signalling in neoblast 


progeny'”” (Fig. 1b), and accordingly fits the instructive role of the 


“tail stuff’. By contrast, no molecules have been reported to function as 
the “head stuff”. 
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Figure 1 | B-catenin signalling affects ERK signalling during regeneration. 
a, Anterior—posterior regional subdivisions. b, Signalling pathways. 

c-i, DjmkpA expression at 3 days of regeneration (Re-3d). c, Control trunk 
fragment (30/30). d, Djf’-catB(RNAi) trunk fragment. e, Control head 
fragment. Strong DjmkpA expression occurred transiently in the tail blastema”, 
and then decreased at this time (26/26). White asterisk indicates DimkpA 
expression in cells associated with pharynx regeneration (left). Most animals 
(13/17) showed no obvious pERK signals around posterior-facing wounds 
(bracket, right). f, Djf-catB RNAi increased DjmkpA expression (left) and 
pERK level in the presumptive tail region (bracket, right). g-i, Regenerating 
head in trunk fragments. Black asterisks (a, c, d and g-i) indicate a pharynx. 


Anterior is to the left. Scale bars: c, 500 um; e, 200 um. 
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We previously reported that mitogen-activated protein kinase 
(MAPK)/extracellular signal-related kinase (ERK) signalling is highly 
activated at wounds and induces DjmkpA, a D. japonica MAPK 
phosphatase gene, in head and tail blastema cells (Fig. 1c); activated 
ERK (phosphorylated ERK; pERK) and DjmkpA form a negative 
feedback circuit that controls blastema cell differentiation (Fig. 1b), 
resulting in regeneration of the head or tail ata wound”. Thus, hitherto 
unknown links between ERK and posterior B-catenin signals might 
direct the appropriate differentiation of neoblasts to reconstruct a 
complete head-to-tail axis, and here we tested this possibility during 
regeneration of D. japonica. 

To sensitively visualize ERK activation in vivo, we monitored 
DjmkpA expression. RNAi of D. japonica B-cateninB’ (Djf-catB) 
clearly increased DjmkpA expression in the presumptive regenerating 
tail via ERK activation (Fig. 1c-f). Dj$-catB RNAi also increased 
DjmkpA expression in the regenerating head region via ERK activation 
(Fig. 1c, d) because this increase was strongly suppressed by treatment 
with MEK inhibitor U0126 (Fig. 1g, h), which inhibits ERK signalling” 
(Fig. 1b and Supplementary Fig. 1). Conversely, increased B-catB 
activity induced by RNAi of D. japonica adenomatous polyposis coli 
(DjAPC) strongly decreased DjmkpA expression in regenerating head 
(Fig. 1i) and interfered with head regeneration (Supplementary Fig. 2). 
These observations suggest that the posterior B-catenin signalling 
level may directly affect the ERK signalling level, and the absence of 
B-catenin signalling allows neoblasts to achieve the ERK activation 
necessary for head regeneration at anterior-facing wounds. 

We next examined whether ERK signalling is also required for 
regeneration of the prepharyngeal and pharyngeal regions. U0126 
(25 uM) caused failure of head and tail regeneration (Fig. 2a, b and 
Supplementary Fig. 3) without affecting neoblast survival, as assayed 
by the expression of DjpiwiA, a neoblast-specific marker gene’ (Sup- 
plementary Figs 1b and 4c). Such inhibitor treatment also prevented 
pharynx regeneration from both head and tail fragments (Fig. 2a, b and 
Supplementary Fig. 3), and blocked reconstruction of the single 
anterior gut branch from pre-existing gut tissues in regenerating tail 
fragments (Fig. 2a, b). Further analysis of regenerating tail fragments 
revealed that lower concentrations of U0126 (5-6.25 1M) severely 
inhibited formation of both an anterior gut within the prepharyngeal 
region and a pharynx, but not head regeneration (Fig. 2c), suggesting 
that quantitative differences of ERK activity influence reconstruction 
of the anterior—posterior pattern. Quantitative reverse transcription- 
polymerase chain reaction (qRT-PCR) analysis revealed a decreasing 
head-to-tail gradient of DimkpA expression in response to ERK activa- 
tion, a pattern reversed compared with that of DjAbd-Ba expression 
(Fig. 2d). Indeed, a lower concentration of U0126 inhibited DjmkpA 
expression most strongly outside the head region (Fig. 2e, f), in the 
same region where severe regeneration defects occurred (Fig. 2c). 
Thus, graded ERK activation along the anterior—posterior axis may 
instruct the reconstruction of the prepharyngeal and pharyngeal 
regions between the head and tail. 

D. japonica nou-darake (Djndk) encodes a fibroblast growth factor 
(FGF) receptor-like transmembrane protein, but lacks the cytoplasmic 
kinase domain characteristic of the FGF receptor family*. Our previous 
finding that Djndk RNAi induced ectopic differentiation of head 
tissues, such as eyes and brain cells, outside the head region’, a 
phenotype opposite to that of ERK inhibition, led us to test the 
relationship between ERK and Djndk for specifying the intermediate 
subdivisions along the anterior—posterior axis during regeneration. 

We demonstrated that de novo expression of Djndk serves as an 
output of the ERK/DjmkpA feedback circuit in the differentiating head 
blastema (Supplementary Fig. 4). In control tail fragments at 3 days of 
regeneration, DjmkpA was highly expressed with a well-demarcated 
posterior end in the head region, and expressed at progressively lower 
levels towards the pharyngeal region (Fig. 3a). Dindk RNAi abolished 
the clearly defined boundary of DjmkpA expression between the head 
and prepharyngeal region (Fig. 3b), presumably due to increased 
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Figure 2 | Effects of pharmacological ERK inhibition on anterior-posterior 
patterning during regeneration. a, Control tail fragment at Re-7d (15/15). 
b, 25 uM U0126 treatment blocked regeneration of the H, Pr and Ph regions at 
Re-7d, but allowed the regenerant to retain the two original posterior branches 
of the gut (10/12), as assayed by anti-AADC antibody staining (green), and 
caused failure to regenerate a pharynx (Hoechst 33342 staining (magenta) (12/ 
12)). ¢, 6.25 UM U0126 treatment caused the regenerant to have two gut 
branches in the presumptive Pr region (5/17; bracket, left), but left head 
regeneration normal at Re-7d (100% of animals, n = 17). This treatment also 
caused failure to regenerate a pharynx (13/17, right). The brain in the H region 
and the pharynx (asterisk in a) are visualized by Hoechst 33342 staining 
(magenta). d, Relative gene expression levels at Re-3d, as determined by qRT- 
PCR. Regenerants were divided into quarters along the anterior—posterior (AP) 
axis and total RNA was isolated from each quarter. Bars show mean = s.d. of 
replicate (RT-PCR assays (n = 3) of pooled RNA from each quarter of 12 
animals (Student’s t-test, *P < 0.05, **P < 0.01). e, f, DijmkpA expression in 
control versus 6.25 UM U0126-treated regenerants at Re-3d. Arrowheads 
indicate DjmkpA expression in cells associated with pharynx regeneration. 
Anterior is to the left. 


DjmkpA expression in the prepharyngeal region. RT-PCR confirmed 
that Djndk RNAi increased the ERK-dependent DjmkpA expression 
(Fig. 3c). Later, Dijndk RNAi caused formation of ectopic head tissues 
predominantly in the prepharyngeal region*"* (Supplementary Fig. 5b) 
and marked loss of prepharyngeal identity’'® (Fig. 3d-f). These 
observations suggest that Djndk RNAi caused transformation of the 
prepharyngeal region into a head-like region through ERK activation 
in anon-cell autonomous manner. Moreover, Djndk RNAi rescued the 
loss-of-pharynx phenotype in a moderately ERK-deficient background 
(Fig. 3c, g-i). Thus, we speculate that Djndk might encode a protein 
that concentrates in the head region and binds to molecules secreted in 
that area (as yet unidentified) that promote ERK activation in neoblasts 
involved in anteriorization. 

We also showed that posterior f-catenin signalling acts to specify 
the pharyngeal, but not prepharyngeal, region during head regenera- 
tion from tail fragments. Djf’-catB RNAi decreased the posteriorly 
biased DjAbd-Ba expression (Fig. 2d), and conversely increased the 
expression of DjmkpA and DjsfrpA (a readout of ERK signalling in 
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Figure 3 | Effects of Dindk RNAi on anterior-posterior patterning during 
regeneration. a, b, DjmkpA expression at Re-3d. Arrowheads indicate DimkpA 
expression in cells associated with pharynx regeneration. c, Relative gene 
expression levels at Re-3d. Bars show mean = s.d. of replicate RT-PCR assays 
(n = 3) of pooled RNA from 15 animals (Student’s t-test, *P < 0.05, 

**P <.01).d, e, Ventral view of DjPN8 expression at Re-7d. Brackets indicate 


the head blastema’’), suggesting that anterior ERK signalling was 
increased in Djfs-catB(RNAi) animals (Fig. 4a-c; Supplementary Fig. 6). 
However, these knockdown animals failed to form a pharynx rudiment 
at 3 days of regeneration (Fig. 4d, e), as assayed by the expression of 
DjFoxA (a marker for pharyngeal cells'®). Combined 51M U0126 
treatment and Djf-catB RNAi additively decreased DjFoxA expression 


U0126 (5 1M) + Dindk(RNAi) 


(16/17) 


DjPN8 expression in the prepharyngeal region that was severely affected by 
Djndk RNAi. Djndk RNAi caused formation of an ectopic eye posterior to an 
original eye. f, The number of DjPN8* cells in the boxed area (n = 3 animals in 
each assay; Student’s t-test, **P < 0.01). Error bars represent s.d. g—i, Staining 
with DjFoxA probe (green) and Hoechst 33342 nuclear staining (magenta) at 
Re-7d. Asterisks (d, e, g and i) indicate a pharynx. Anterior is to the left. 


(Fig. 4c), suggesting that interplay between ERK and posterior 
B-catenin signals is required to regenerate pharyngeal cells. At 7 days 
of regeneration, Djf’-catB(RNAi) animals showed roughly normal 
regeneration of the head and prepharyngeal regions (100% of animals, 
n = 30) and ectopic fusion of the two posterior branches of the gut in 
the presumptive pharyngeal region (Fig. 4f), a phenotype opposite to 
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Figure 4 | Phenotypes induced by RNAi of f-catenin in different planarian 
species. a, b, DjmkpA expression at Re-3d. Arrowheads indicate DjmkpA 
expression in cells associated with pharynx regeneration. c, Relative gene 
expression levels at Re-3d. Bars show mean = s.d. of replicate RT-PCR assays 
(n = 3) of pooled RNA from 15 animals (Student’s t-test, *P < 0.05, 

**P <.01).d, e, DjFoxA expression at Re-3d. Asterisk (d) indicates a pharynx 
rudiment. f, Staining with anti-AADC antibody (green) and Hoechst 33342 
nuclear staining (magenta) at Re-7d. The Djfs-catB(RNAi) animal completely 
lacks a pharynx, and instead shows ectopic fusion of the two posterior gut 
branches (bracket). g, Molecular logic for planarian regeneration along the 
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anterior—posterior (A-P) axis. h, i, Left panels, control tail fragment of 
Phagocata kawakatsui at 19 days after amputation and a Pkf-catB(RNAi) tail 
fragment at 16 days after amputation. Right panels, enlarged view of 
corresponding boxed areas in left panels. The Pkf-catB(RNAi) animal 
regenerated a head-like structure, including a brain visualized by strong 
staining with Hoechst 33342 (blue) and with anti-synaptotagmin (SYT) 
antibody (green), and a paired cluster of photoreceptor cells visualized by anti- 
arrestin staining (magenta; arrowheads). The control animal failed to 
regenerate a head and showed high background signals of anti-SYT staining. 
Anterior is to the left. 
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that induced by ERK inhibition (Fig. 2c), at the expense of the pharynx 
structure. This suggests that Djs-catB RNAi caused transformation of 
the pharyngeal region into a prepharyngeal-like region through ERK 
activation. 

Altogether, our findings encourage us to refine Morgan’s hypothesis 
at the cellular and molecular levels as follows. We showed that Djndk 
RNAi anteriorized the prepharyngeal region, and Djfs-catB RNAi 
anteriorized the pharyngeal and tail regions, with increases of ERK 
activity level in both cases, suggesting that regenerative cells are prone 
to undergo differentiation into head tissues in the absence of these two 
posteriorizing signals (Fig. 4g and Supplementary Fig. 5). Accordingly, 
we propose that neoblasts might follow a default program of differ- 
entiation into head tissues that is triggered by the activation of ERK 
and calls to mind the old description by Morgan in 1904 that “in the 
absence of polarity (i.e. posteriorizing signals) the new material (i.e. de 
novo pERK) produces a head” in Planaria maculata’. Thus, activated- 
ERK triggering of the default program might correspond to Morgan’s 
“head stuff’. To validate this, we will have to demonstrate the existence 
of a pERK gradient itself in planarians. 

The ability to regenerate a head varies among planarian species”. In 
the planarian Phagocata kawakatsui, the tail region shows markedly 
reduced head-regenerative ability, whereas tail regeneration is normal 
(Fig. 4h and Supplementary Fig. 7). We reasoned that if head tissue 
differentiation is a default property of neoblasts in response to ERK 
activation not only in D. japonica but across planarian species 
(Supplementary Fig. 8), excessive posterior f-catenin signalling might 
be interfering with the ERK activation necessary for head regeneration 
from tail fragments of P. kawakatsui. Indeed, RNAi of P. kawakatsui 
f-cateninB (Pkp-catB) enabled de novo regeneration of a head-like 
structure, including brain tissues and a pair of eyes, at anterior-facing 
wounds of tail fragments (9/19; Fig. 4i). Moreover, seven of these nine 
regenerants showed negative phototaxis” (a typical outcome of the 
function of the brain and eyes”), in contrast to regenerants from 
control tail fragments (0/13; Supplementary Fig. 9). Thus, we 
identified a genetic cause of the lack of head-regenerative capacity in 
this planarian species. We speculate that the balance between anterior 
ERK signalling and posterior B-catenin signalling may vary among 
planarian species, resulting in the drastic differences of their head- 
regenerative capacity. 

In summary, we have not only integrated the concept of default 
differentiation of stem cells into head tissues into Morgan’s hypothesis, 
but have also demonstrated that posterior Wnt/B-catenin signalling 
interferes with head tissue differentiation in planarians by overriding 
this default. 


METHODS SUMMARY 


Planarian culture, drug treatments and quantitative RT-PCR”, behavioural 
assay”, gene silencing by RNA interference'”'* and whole-mount in situ 
hybridization were performed as reported. The Pkf-catB gene sequence was 
obtained from transcriptome analysis of P. kawakatsui by next generation 
sequencing with a Roche 454 GS FLX Titanium. Based on the sequence data, a 
1,300-base-pair complementary DNA fragment of Pkf-catB was amplified from a 
cDNA library of non-regenerating intact planarians constructed in lambda ZAP II 
(Stratagene) using oligo(dT) primers, and used as a template to make double- 
stranded RNA. Whole-mount immunostaining is described in the Methods 
section. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Animals. A clonal strain of the planarian Dugesia japonica derived from the Iruma 
River in Gifu prefecture, Japan, which is maintained in autoclaved tap water at 
22-24 °C, was used in this study. An isolated colony of the planarian Phagocata 
kawakatsui derived from the Yogo Lake area in Shiga prefecture, Japan, which is 
maintained in autoclaved tap water at 16-18 °C, was used in this study. These 
planarians (about 8 mm in length) were starved for at least 1 week before experi- 
ments. They were transversely dissected into three pieces: head, trunk (including 
pharynx) and tail, and allowed to undergo regeneration. 

Treatment with a chemical inhibitor. The MAPK/ERK kinase (MEK) inhibitor 
U0126 (Cell Signaling Technology) was dissolved in dimethyl sulphoxide (DMSO) 
and used at the indicated concentration(s) for each experiment. Amputated 
planarians were allowed to regenerate in tap water supplemented with the 
inhibitor or DMSO alone immediately after amputation for the indicated period 
of regeneration for each experiment. 

Identification of P. kawakatsui f-cateninB (Pkf-catB) gene. Transcriptome 
analysis of P. kawakatsui was performed by next generation sequencing with 
Roche 454 GS FLX Titanium. For this purpose, messenger RNA was isolated from 
190 trunk fragments containing a pharynx and from 178 tail fragments. The 
PkB-catB gene sequence was obtained from the Roche 454 sequencing reads by 
using the full-length amino acid sequence encoded by the D. japonica [-catB gene 
as a query through tblastn search. A 1,300-bp cDNA fragment of Pkf-catB was 
amplified from a cDNA library of non-regenerating intact planarians constructed 
in lambda ZAP II (Stratagene) using oligo(dT) primers and used in this study. 
RNA interference. Double-stranded RNA (dsRNA) was synthesized essentially as 
previously described’”'*. Sequences used to generate dsRNAs are shown in 
Supplementary Fig. 10. dsRNA was suspended in diethylpyrocarbonate 
(DEPC)-treated HO. Intact planarians were injected with dsRNA every other 
day for a total of 2 times using a Drummond Scientific Nanoject II injector 
(Drummond Scientific Company). Control animals were injected with DEPC- 
treated HO alone. In the case of P. kawakatsui, no injection was performed in 
control animals. One day after the last injection, planarians were amputated and 
allowed to undergo regeneration. 

Whole-mount in situ hybridization. Animals were treated with 2% hydrochloric 
acid (HCI) in 5/8 Holtfreter’s solution for 5 min at 4 °C and fixed in 5/8 Holtfreter’s 
solution containing 4% paraformaldehyde and 5% methanol for less than 2 h at 4 °C. 
Hybridization and colour detection of digoxigenin (DIG)-labelled RNA probes were 
carried out as previously described”*. A mixture of BCIP/NBT (5-bromo-4-chloro-3- 
indolyl-phosphate/nitro blue tetrazolium) (Roche), HNPP/Fast Red (Roche) or 
TSA Labelling (Molecular Probes) was used for colour development of the alkaline 
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phosphatase-conjugated anti-DIG antibody (Roche) or for detection of 
fluorescence. 

Whole-mount immunostaining. Planarians were treated with 2% HCl in 5/8 
Holtfreter’s solution for 5 min at 4°C and washed twice with 5/8 Holtfreter’s 
solution at room temperature. They were then fixed with 5/8 Holtfreter’s solution 
containing 4% paraformaldehyde and 5% methanol for less than 2h at 4°C. The 
fix solution supplemented with PhosSTOP phosphatase inhibitor (Roche) was 
used for the detection of phosphorylated protein. To lower the background, 
fixed planarians were treated with hybridization buffer for whole-mount in situ 
hybridization overnight at 55 °C. Fixed animals were blocked with 10% Blocking 
reagent (Roche) in PBST (phosphate buffered saline containing 0.1% Triton 
X-100) for 1h at 4°C, and then incubated with 1/1,000 diluted mouse 
anti-AADC”, 1/1,000 diluted mouse anti-DjPiwiA”’, 1/200 diluted rabbit anti- 
phosphorylated histone H3 (Upstate Biotechnology), 1/1,000 diluted rabbit anti- 
phosphorylated ERK (pERK)”*, 1/1,000 diluted mouse anti-synaptotagmin™ or 
1/1,000 diluted rabbit anti-arrestin'* antibody overnight at 4 °C. The samples were 
washed with PBST for 30 min four times and signals were detected with 1/500 
diluted Alexa Fluor 488- or 594-conjugated goat anti-mouse or anti-rabbit IgG 
(Invitrogen) in 10% goat serum in PBST overnight at 4°C in the dark. pERK 
signals were detected with 1/500 diluted alkaline phosphatase-conjugated goat 
anti-rabbit IgG and HNPP/Fast Red (Roche). The samples were also incubated 
with 1/1,000 diluted Hoechst 33342 at the same time. 

Reverse transcription and quantitative RT-PCR analysis. Total RNA was iso- 
lated from 15 whole tail fragments of D. japonica using Isogen LS (Nippon Gene). 
Twelve regenerating tail fragments of D. japonica were divided into quarters along 
the anterior—posterior axis and total RNA was also isolated from each quarter. In 
the case of P. kawakatsui, the stock of mRNA isolated for transcriptome analysis 
was used. First-strand cDNA was synthesized using a QuantiTect Reverse 
Transcription Kit (Qiagen). Quantitative analysis of the amount of each gene 
product was carried out as previously described” using an iCycler real-time 
PCR machine (Bio-Rad Laboratories). Measurements were normalized by the 
expression level of a constitutively transcribed housekeeping gene, GAPDH. The 
mean of three replicate qRT-PCR assays was reported. Oligonucleotide primer 
sequences used for the assays are listed in Supplementary Fig. 11. 

Behavioural assay. Ten millilitres of planarian culture water (23 °C) was put in a 
3-cm wide X 6-cm long X 1-cm deep quartz rectangular container painted black 
except for one clear end. A planarian was put into the container 1.5 cm from the 
clear end, and the container was exposed to 300 lx of white light from a horizontal 
position at the clear end of the container. Planarian behaviour was recorded using 
an overhead digital video camera (Sony) for 90s. 
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Restoration of anterior regeneration in a planarian 
with limited regenerative ability 


James M. Sikes'} & Phillip A. Newmark! 


Variability of regenerative potential among animals has long 
perplexed biologists’. On the basis of their exceptional regenerative 
abilities, planarians have become important models for understand- 
ing the molecular basis of regeneration”. However, planarian species 
with limited regenerative abilities are also found**. Despite the 
importance of understanding the differences between closely related, 
regenerating and non-regenerating organisms, few studies have 
focused on the evolutionary loss of regeneration*, and the molecular 
mechanisms leading to such regenerative loss remain obscure. Here 
we examine Procotyla fluviatilis, a planarian with restricted ability to 
replace missing tissues’, using next-generation sequencing to define 
the gene expression programs active in regeneration-permissive 
and regeneration-deficient tissues. We found that Wnt signalling 
is aberrantly activated in regeneration-deficient tissues. Notably, 
downregulation of canonical Wnt signalling in regeneration-deficient 
regions restores regenerative abilities: blastemas form and new heads 
regenerate in tissues that normally never regenerate. This work 
reveals that manipulating a single signalling pathway can reverse 
the evolutionary loss of regenerative potential. 

Unlike more commonly studied planarians’, representatives of the 
family Dendrocoelidae regenerate heads only when amputated within 
the anterior third of the body, yet retain full posterior regeneration 
ability along the anterior—posterior axis (Fig. 1a, b and Supplementary 
Fig. 1)°*. After amputation in regeneration-deficient tissues, P. fluviatilis 
fails to produce a blastema and never regenerates anterior structures 
(Fig. 1b and Supplementary Fig. 1). The stage at which regenerative 
processes fail in this animal is unknown. Planarian regeneration encom- 
passes several processes, including wound healing to cover exposed tissues 
and allow signalling between the wound epidermis and underlying 
mesenchymal cells’, apoptosis®, and stem cell (neoblast) proliferation’. 
Following amputation, neoblast proliferation occurs in two waves: a 
systemic response shortly after amputation and a later burst near the 
wound site’. In addition, regenerating tissues re-establish proper axial 
polarity, using a number of conserved signalling pathways'®"*. Sub- 
sequently, tissue outgrowth occurs as an undifferentiated mass of cells 
(the regeneration blastema) differentiates to replace lost structures’». 

To identify the nature of regeneration failure in P. fluviatilis, we 
characterized several of these early regenerative processes following 
amputation in both regeneration-proficient (Reg) and regeneration- 
deficient (Reg ) tissues. Histological staining and electron microscopy 
revealed that wound healing occurs properly following amputation in 
all tissues, regardless of regenerative potential (Fig. 1c, d). After ampu- 
tation in Reg” tissues, biphasic cell division occurs in both Reg and 
Reg tissues (Fig. le, fand Supplementary Fig. 2). In addition, gut tissues 
seem to remodel'® in fragments that fail to regenerate (Supplementary 
Fig. 1). These data show that early phases of the regenerative response 
occur, although Reg’ tissues fail to form a blastema. 

To assess whether axial polarity is re-established properly following 
amputation in Reg tissues of P. fluviatilis, we characterized the spatio- 
temporal expression of homologues of nou-darake (ndk) and sFRP1, 
genes expressed specifically at the anterior of other planarian 


species’?"*'”, Both Pfu-ndk and Pfu-sFRP1 are expressed at the 
anterior wound site shortly after amputation in Reg” tissue (Fig. 1g, 
i). However, the expression of these genes was reduced in Reg tissue 
following injury (Fig. 1h, j), indicating that the initial failure of regen- 
eration occurs at or upstream of axial re-polarization. 

Because these polarity markers are not expressed appropriately 
following amputation in Reg” fragments, we sought to identify gene 
expression differences between Reg” and Reg’ tissue after amputation. 
We generated a de novo P. fluviatilis transcriptome and used RNA 
sequencing (RNA-seq) to characterize transcripts from excised tissue 
fragments in Reg’ and Reg” body regions 24h post amputation 
(Fig. 2a). We performed parallel analyses on tissues excised from intact 
animals at identical body regions to account for regional differences in 
transcripts, thereby identifying changes resulting from amputation 
(Fig. 2a). Analysis of amputated versus intact tissues revealed that 
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Figure 1 | Procotyla fluviatilis fails to regenerate heads after amputation in 
posterior body regions. a, b, Proficient and failed head regeneration 10 days 
after amputation in Reg* and Reg’ regions, respectively (n = 25/25). Dashed 
lines indicate amputation planes. c, d, Complete wound epithelium (arrows; 
n = 8/9) and full wound closure (n = 7/8) 48 h after amputation in Reg’ tissue. 
Scale bar in d, 100 um. e, f, Mitotic activity 4 days after amputation in Reg* and 
Reg’ tissues (n = 10/treatment) assayed by phosphohistone H3 (H3S10ph) 
immunostaining. g-j, Pfu-ndk and Pfu-sFRP1 expression 24h after amputation 
in Reg” and Reg’ tissues (n = 8/treatment). Anterior is to the left. Scale bars, 
250 tum unless otherwise noted. 
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Figure 2 | Comparative transcriptomics reveal differentially expressed 
genes following amputation in Reg* and Reg’ tissues. a, Experimental 
strategy to identify transcripts responsive to amputation in Reg* (red) and Reg™ 
(blue) tissues. b, Alteration in expression levels of Wnt signalling components 
after amputation in Reg* (red) and Reg” (blue) tissues measured by RNA-seq. 


10.7% of the assembled contigs (16,026/149,594) were significantly 
altered = twofold (P < 0.05) after amputation in either Reg’ or Reg” 
tissues. After collapsing contigs likely to be representing the same 
transcript based upon BLAST comparison similarity, we focused our 
analysis on 15,742 contigs that seem to be expressed differentially after 
amputation (based on the large number of contigs, many individual 
transcripts are likely to be represented by multiple contigs). Whereas a 
small number of contigs were simultaneously over- or under-represented 
in both conditions (74/15,742), many were over- or under-represented 
exclusively in either Reg’ or Reg” fragments (14,288/15,742). Other 
contigs were over-represented in Reg” tissue and under-represented in 
Reg’ tissue (537/15,742) or vice versa (842/15,742) (Supplementary 
Table 1). Upon close examination of transcripts over-represented in 
Reg” tissues and under-represented in Reg* tissues, we found that 
several represented genes were involved in Wnt signalling. Given the 
importance of Wnt signalling in defining anteroposterior polarity in 
other planarian species’®'’'*, we focused on genes involved in this 
pathway. 

RNA-seq revealed significant over-representation of many tran- 
scripts encoding Wnt ligands and receptors in Reg tissues after ampu- 
tation (Fig. 2b), with some transcripts, such as Pfu-wnt11-1, upregulated 
as much as ~400-fold relative to intact controls. These same trans- 
cripts were downregulated in Reg” tissues relative to their position- 
adjusted intact controls (Fig. 2b). In addition, homologues of Wnt 
inhibitors, such as sFRP1 and sFRP2 (ref. 18), were downregulated in 
Reg” tissues and upregulated in Reg” tissues (Fig. 2b). These patterns 
of gene expression were confirmed by quantitative PCR with reverse 
transcription (qRT-PCR) (Fig. 2c, d). Pfu-B-catenin1, the intracellular 
effector of Wnt signalling, was not expressed differentially following 
amputation in either tissue region (Fig. 2b), indicating that Pfu-p- 
catenin1 expression is not responsive to wounding in either Reg* or 
Reg tissues. 

Because axial repolarization fails in Reg tissues after amputation, we 
propose that upregulation of posteriorly expressed genes, including 
Wnt ligands, may inhibit signals that lead to proper anterior—posterior 
patterning and, thus, block regeneration. To test this hypothesis, we 
disrupted Wnt signalling using RNA interference (RNAi) to target 
Pfu-B-catenin1, the intracellular Wnt signalling effector. Remarkably, 
Pfu-f-catenin1 RNAi resulted in blastema formation and regeneration 
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c, d, Changes in transcript levels based on relative quantification (RQ) of 
selected anterior- and posterior-specific Wnt signalling components after 
amputation in Reg* (red) and Reg’ (blue) tissues relative to uncut controls 
measured by qRT-PCR. Error bars represent standard deviations. 


of a complete head and brain in Reg” fragments as assayed by regen- 
eration of the photoreceptors (n = 64/71) (Fig. 3b, f), whereas control 
RNAi animals, injected with double-stranded RNA from a ccdB and 
camR-containing bacterial sequence, failed to form a blastema or 
neural structures (Fig. 3a, e). Pfu-f-catenin1 RNAi also affected pos- 
terior regeneration with the formation of heads at posterior facing 
wounds (Supplementary Fig. 3), which has been observed in other 
planarian species'®. Knocking down Wnts individually or in combina- 
tion did not rescue Reg tissue (Supplementary Table 2). Pfu-f-catenin1 
(RNAi) animals with rescued regeneration demonstrated anteriorly 
directed movements within 15 days after amputation (Supplementary 
Video 1), indicating complete and functional regeneration of the head. 
Rescue of regeneration in Pfu-f-catenin1(RNAi) animals reveals that 
Reg tissues are competent to express the head regeneration program, 
but either lack signals required for re-establishment of axial polarity or 
these signals are inhibited. 

In contrast, knockdown of APC, an inhibitor of S-catenin and Wnt 
signalling, also resulted in blastema formation, but led to regeneration 
of tails at anterior-facing wounds (n = 14/53, Fig. 3c, g), as observed 
in Schmidtea mediterranea’. These data indicate that altering gene 
expression to create either an anterior or posterior polarity cue within 
Reg’ tissues can perpetuate downstream steps in regeneration, thereby 
allowing blastema formation and regeneration of heads or tails at 
anterior-facing wounds. Notum has recently been identified as a 
Wnt inhibitor expressed at anterior facing wounds in the planarian 
S. mediterranea”. However, Pfu-notum RNAi failed to initiate a sim- 
ilar posterior-regeneration program. To assay alterations in gene 
expression following regeneration rescue, we used qRT-PCR to char- 
acterize expression patterns of polarity genes following Pfu-/-catenin1 
RNAi and Pfu-APC RNAi (Fig. 3h). After amputation in Pfu-f- 
catenin1(RNAi) animals, both Pfu-sFRP1 and Pfu-ndk were signifi- 
cantly upregulated, indicating that a latent anterior regeneration 
program is reactivated in Reg’ tissues following Pfu-f-catenin1 RNAi. 

Our data provide important clues about mechanisms regulating 
regeneration. In a related Dendrocoelid species with reduced regen- 
erative capacity, anterior regeneration ability in posterior fragments 
was rescued through grafting irradiated anterior tissue onto Reg” 
tissue”, suggesting that signals from differentiated anterior cells were 
sufficient to allow regeneration. Our results indicate that such signals 
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Figure 3 | Disruption of Wnt signalling via RNAi rescues regeneration in 

Reg tissues. a—c, Posterior fragments 14 days after amputation in Reg regions 
of control RNAi, Pfu--catenin1(RNAi), and Pfu-APC(RNAi) animals. Black 

arrowheads, gut branches indicative of posterior fate; dashed lines, amputation 
planes; white arrows, eyespots. d~g, Nervous system in uncut and Reg” body 
regions of control RNAi, Pfu-/-catenin1(RNAi), and Pfu-APC(RNAi) animals 
28 days after amputation. Anterior is to the left. Scale bars, 250 um. h, Relative 


are involved in re-establishing anterior—posterior polarity. Recent 
efforts have identified many requirements for the regenerative res- 
ponse to wounding, including proliferation’, apoptosis® and cell sig- 
nalling'’’*', but the interrelationships between these processes are 
not yet well understood. Our data confirm that increased neoblast 
proliferation following amputation occurs independently of the re- 
establishment of anterior—posterior polarity (Fig. le, fand Supplemen- 
tary Fig. 2)'*. We have also shown that subsequent regenerative 
processes are inhibited until polarity is re-established. These observa- 
tions suggest a checkpoint in the regeneration program that must be 
satisfied before downstream developmental processes can occur. Such 
a checkpoint would act as a vulnerable stage at which evolutionary 
modifications could alter regenerative potential. 

Loss of regenerative ability would seem to carry a selective dis- 
advantage, and possible causes for such losses have been the subject 
of much speculation’». Identification of aberrant Wnt signalling fol- 
lowing amputation in Reg tissues of P. fluviatilis suggests one mech- 
anism limiting regeneration ability in planarians; however, we can only 
speculate about the driving force leading to regeneration loss. Unlike 
most planarians, P. fluviatilis is semelparous, reproducing only once, 
then dying within a single season’. Life-history studies have shown that 
semelparous species invest more in reproduction”, possibly at the 
expense of other developmental programs. Whereas highly regenerat- 
ive, iteroparous planarian species resorb reproductive structures 
during periods of starvation” and following amputation”, semelparous 
species do not resorb testes after such events***. We speculate that the 
signals maintaining the reproductive system in semelparous planarians 
may perturb the proper re-establishment of polarity after amputation, 
providing a selective advantage for later reproductive potential. As a 
consequence, anterior regeneration ability has been limited. 

Next-generation sequencing technologies and functional analyses 
facilitate research on understudied, yet biologically informative, non- 
model organisms. Expanding use of these technologies will help 
elucidate causes for limited regeneration in other animals, potentially 
identifying inhibitory signals that must be overcome to elicit a regen- 
erative response after wounding. Given that perturbation of a single 
gene’s function can rescue an entire regenerative program, identifying 
additional regeneration-inhibiting signals will increase our understand- 
ing of the evolution of regeneration loss and provide the intriguing 
prospect of restoring regenerative abilities in regeneration-deficient 
animals. 
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transcript levels in Reg” tissue from Pfu-f-catenin1(RNAi), Pfu-APC(RNAi) 
and control RNAi animals 48 h after amputation. Significant differences 
between control and experimental samples based on P values (<0.05) from 
two-tailed unpaired t-test are shown. (*Pfu-[-catenin1(RNAi) versus control 
RNAi; **Pfu-APC(RNAi) versus control RNAi). Error bars represent 
standard deviations. 


METHODS SUMMARY 


In situ hybridization. In situ hybridizations were performed using formaldehyde 
fixation as described previously~* with minor modifications. 

Immunostaining. Immunostaining was conducted as described previously” using 
anti-synapsin mouse monoclonal, anti-phospho-histone H3 (Ser10) rabbit mono- 
clonal, goat anti-mouse Alexa-488, and goat anti-rabbit Alexa-488 antibodies. 
Histology and scanning electron microscopy. Samples were initially prepared as 
described previously**. For histology, samples were sectioned and stained with 
toluidine blue. For scanning electron microscopy, samples were mounted on 
aluminium stubs and coated with Au/Pd. 

Transcriptome sequencing and RNA-seq. Roche 454 pyrosequencing techno- 
logy was used to sequence a reference transcriptome from the RNA of four intact 
P. fluviatilis. Reads were assembled de novo using iterations of SeqMan NGen and 
CLC Genomics. Solexa Illumina sequencing was performed on RNA extracted 
from excised tissues. Reads were mapped to the reference transcriptome using 
CLC Genomics as described previously”. 

Cloning. Candidate genes were amplified via PCR from complementary DNA 
(Supplementary Table 3) and homology was assigned (Supplementary Fig. 4). 
PCR products were ligated with Eam1105I-digested pJC53.2 (ref. 30) and used 
to transform DH5« Escherichia coli cells. To generate riboprobes, in vitro trans- 
cription reactions were performed with the appropriate RNA polymerase using 
standard approaches and the addition of digoxigenin-11-UTP. 

RNAi. dsRNA was synthesized via in vitro transcription using amplified templates 
(Supplementary Fig. 5). dsRNA solution was mixed with dye and microinjected 
into the gut three times over the course of 1 week before amputation. As a negative 
control, animals were injected with dsRNA synthesized from the ccdB- and camR- 
containing insert of pJC53.2 (ref. 30). 

Quantitative RT-PCR. cDNA was synthesized from RNA isolated as described 
for RNA-seq and used to conduct quantitative PCR. Three biological replicates 
were performed. Absolute quantities were determined for each gene and normalized 
to the endogenous level of Pfu-actin. The mean value for each treatment was then 
normalized to control conditions to determine the relative changes in expression. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Animal collection and culture. Procotyla fluviatilis was collected from streams at 
Blockhouse Point Conservation Park (Maryland, USA) or the LaRue Pine Hills/ 
Otter Pond Research Natural Area (Illinois, USA). Animals were maintained in 
the laboratory at 18°C in Montjuich salts* and fed bi-weekly. Planarians were 
starved for 1 week before use. 

In situhybridization. In situ hybridizations were performed using the formaldehyde- 
based fixation procedure as described previously” with the following modifications. 
Planarians were killed in 8% n-acetyl cysteine for 5 min, fixed in 4% formaldehyde 
in PBSTX (PBS + 0.3% Triton X-100) for 25 min, and incubated in reduction 
solution for 5 min at room temperature. Samples were bleached in 6% H2O; for 1h. 
Samples were imaged with a Leica M205A stereoscope. 

Immunostaining. Immunostaining was conducted as described previously” 
using methacarn fixative. Primary antibody incubation was performed overnight 
at 4 °C at the following concentrations: anti-synapsin mouse monoclonal antibody 
(1:75; Developmental Studies Hybridoma Bank, catalogue no. 3C11), anti- 
phospho-histone H3 (Serl0) rabbit monoclonal antibody (1:1,000; Cell 
Signaling Technology, catalogue no. 3377). Animals were incubated in secondary 
antibody (goat anti-mouse Alexa-488, 1:400 or goat anti-rabbit, Alexa-488, 1:500; 
Molecular Probes, Invitrogen, catalogue no. A11029) overnight at 4°C. Animals 
were mounted in Vectashield. Images were captured with Zeiss SteREO 
LumarV12, Zeiss AxioZoomV16, and Zeiss LSM710 confocal microscopes. 
Histology and scanning electron microscopy. Animals were fixed and prepared 
as previously described’*. For histology, ethanol was gradually replaced with 
acetone, followed by infiltration with Araldite/Embed 812 (Electron Microscopy 
Sciences). Sections (1 1m) were collected on glass slides, dried over a heating plate, 
and stained with 1% toluidine blue for 10 s. Slides were mounted with Cytoseal 60 
(Thermo Scientific) and imaged on a Nikon Eclipse TE200 inverted microscope. 
For scanning electron microscopy, once the samples reached 100% ethanol, they 
were critical-point dried using a Tousimis Samdri-PVT-3D, mounted on 
aluminium stubs, coated with Au/Pd using a Denton Desk II TSC turbo-pumped 
sputter coater and imaged on a Philips XL30 ESEM-FEG. 

Transcriptome sequencing and RNA-seq. For sequencing the reference tran- 
scriptome, RNA from four random, intact P. fluviatilis adults was isolated using 
TRIzol (Invitrogen), DNase-treated, purified with an RNA Clean and Concentrator 
kit (Zymo Research), and submitted to the W. M. Keck Center for Comparative 
and Functional Genomics for Roche 454 pyrosequencing. Reads were assembled 
de novo using iterations of SeqMan NGen (DNASTAR) and CLC Genomics 
(CLCbio). For RNA-seq experiments, RNA was isolated from tissue fragments 
of five worms, each excised ~2 mm posterior to the amputation sites. Control RNA 
was purified from corresponding control fragments excised from intact animals at 
equivalent body regions as described above. Samples were submitted to the W. M. 
Keck Center for Comparative and Functional Genomics for Illumina sequencing. 
Reads were mapped to the reference transcriptome using CLC Genomics and compared 
as in ref. 29. Contigs with = 2-fold change and P value < 0.05 (from two-sided 
unpaired f-tests) were used for detailed analysis. Selected contigs were screened to 
identify redundant contigs by using BLAST homology to the NCBI nr database to 
merge contigs with top hits to the same gene. 
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Cloning. To generate riboprobes, candidate genes were PCR amplified from 
cDNA generated from total RNA (iScript cDNA Synthesis Kit, Bio-Rad). For 
cDNA preparations, RNA was extracted using TRIzol Reagent (Invitrogen). For 
cloning, 2-3 ul of PCR product was ligated with 70ng of Eam1105I-digested 
pJC53.2 (Rapid DNA Ligation kit, Roche)*® and used to transform DH5a 
Escherichia coli cells. In vitro transcriptions with the appropriate RNA polymerase 
were performed using standard approaches with the addition of digoxigenin-11- 
UTP (Roche). All primers used to amplify candidate genes are included in 
Supplementary Table 3. 

RNAi. To generate dsRNA, templates cloned into pJC53.2 (ref. 30) were amplified 
with a modified T7 oligonucleotide (GGATCCTAATACGACTCACTATAGGG), 
purified using a DNA Clean & Concentrator kit (Zymo Research), and eluted in 
15 pl of water. 10.5 pl of each PCR product (Supplementary Fig. 5) was used as 
template for in vitro transcription in a reaction containing 5 pl 100 mM mix of 
ribonucleotide triphosphates (rNTPs) (Promega), 1 pl high-yield transcription 
buffer (0.4M Tris pH 8.0, 0.1M MgCl, 20mM spermidine, 0.1M DTT), 1 pl 
thermostable inorganic pyrophosphatase (New England Biolabs), 0.5 ,l Optizyme 
recombinant RNase inhibitor (Fisher Scientific), and 2 1 T7 RNA polymerase. 
Samples were incubated at 37 °C for 12 h and then treated with RNase-free DNase 
(Fisher Scientific, catalogue number FP2231) and cleaned/concentrated via ammo- 
nium acetate precipitation. Synthesized RNA was then annealed by heating at 
95°C, 75°C and 50 °C each for 3 min. dsRNA solution was mixed with dye and 
65nl (~1 pg pl ') was microinjected into the gut of randomized adult planarians 
three times over the course of 1 week before amputation using a Nanoject II 
micromanipulator (Drummond Scientific). As a negative control, animals were 
injected with dsRNA synthesized from the ccdB- and camR-containing insert of 
pJC53.2 (ref. 30). Live (RNAi) animals were imaged and videos were captured with 
a Leica M205A. 

Quantitative RT-PCR. To examine transcript levels following amputation in 
regeneration-proficient and regeneration-deficient tissues, RNA was extracted 
using TRIzol reagent from five random tissue fragments identical to those used 
for RNA sequencing. Following DNase treatment (DNA-free RNA Kit, Zymo 
Research), reverse transcription was performed (iScript cDNA Synthesis Kit) 
and quantitative PCR was conducted using Power SYBR Green PCR Master 
Mix (Applied Biosystems) and a 7900HT real-time PCR system (Applied 
Biosystems). Three biological replicates were performed and all samples were 
measured in triplicate to account for pipetting error. Absolute quantities of each 
transcript were determined for each gene and normalized to the level of Pfu-actin 
in each sample. The mean value for each amputated treatment was then normal- 
ized to the intact tissue fragments extracted from an identical axial position to 
determine the relative changes in expression due to amputation. For qRT-PCR of 
genes following RNAi experiments, the mean values of control RNAi and experi- 
mental (RNAi) samples were graphed independently without normalization to 
intact fragments. All primers used for these studies are included in Supplementary 
Table 3. 


31.  Cebria, F. & Newmark, P. A. Planarian homologues of netrin and netrin receptor 
are required for proper regeneration of the central nervous system and the 
maintenance of nervous system architecture. Development 132, 3691-3703 
(2005). 
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Reactivating head regrowth in a 
regeneration-deficient planarian species 


S.-Y. Liu’, C. Selck?, B. Friedrich’, R. Lutz”, M. Vila-Farré!, A. Dahl’, H. Brandl', N. Lakshmanaperumal., I. Henry! & J.C. Rink!? 


Species capable of regenerating lost body parts occur throughout 
the animal kingdom, yet close relatives are often regeneration 
incompetent’”. Why in the face of ‘survival of the fittest’ some 
animals regenerate but others do not remains a fascinating 
question’. Planarian flatworms are well known and studied for 
their ability to regenerate from minute tissue pieces, yet species 
with limited regeneration abilities have been described even 
amongst planarians*. Here we report the characterization of the 
regeneration defect in the planarian Dendrocoelum Jacteum and 
its successful rescue. Tissue fragments cut from the posterior half 
of the body of this species are unable to regenerate a head and 
ultimately die’. We find that this defect originates during the early 
stages of head specification, which require inhibition of canonical 
Wnt signalling in other planarian species**. Notably, RNA inter- 
ference (RNAi)-mediated knockdown of Dlac-f-catenin-1, the 
Wnt signal transducer, restored the regeneration of fully func- 
tional heads on tail pieces, rescuing D. lacteum’s regeneration 
defect. Our results demonstrate the utility of comparative studies 
towards the reactivation of regenerative abilities in regeneration- 
deficient animals. Furthermore, the availability of D. lacteum 
as a regeneration-impaired planarian model species provides a 
first step towards elucidating the evolutionary mechanisms that 
ultimately determine why some animals regenerate and others 
do not. 
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The two current model planarian species, Dugesia japonica’? and 
Schmidtea mediterranea '°, were selected precisely because of their 
excellent regenerative abilities. Others amongst the many hundred 
planarian species worldwide” are regeneration deficient. We decided 
to address the mechanistic basis of such inter-species variation by 
example of the species D. lacteum, which has been reported to be 
incapable of regenerating heads from their posterior body half**. We 
found a D. lacteum population near Nottingham, UK, that adapted 
well to laboratory culture conditions and could be expanded via 
rounds of anterior amputations. Assaying our strain’s regenerative 
ability along the anterior—posterior axis confirmed lack of head regen- 
eration from tail pieces (Fig. 1a) in specific contrast to the global head 
regeneration ability of S. mediterranea (Fig. 1b). Previous character- 
izations of the D. lacteum regeneration defect were limited by the 
methods available at the time*'*"*. Intermittently obtained molecular 
insights in the planarian model species suggested a comparative 
approach towards the molecular diagnosis of D. lacteum’s regenera- 
tion defect. To enable the identification and analysis of marker genes in 
D. lacteum and to obtain a mapping reference for RNA-sequencing 
(RNA-seq) experiments, we de novo assembled a D. lacteum transcrip- 
tome (Supplementary Fig. 1 and ref. 15). Our transcriptome provides a 
representative overview of the species’ protein coding genes, as indi- 
cated by the presence of orthologues to all 248 core eukaryotic genes'® 
(Supplementary Fig. 2) or to all Wnt ligands and Fzd receptors known 


Figure 1 | D. Jacteum cannot regenerate heads 
posteriorly. a, b, Transverse sections assaying 
regenerative abilities along D. lacteum (a) or 

S. mediterranea (b) anterior—posterior axis. 
Coloured lines indicate cutting planes, resulting 
fragments are shown 14 d.p.a. Graphs quantify 
head regeneration by timing of eye appearance in 
cohorts of 10 fragments, colour-coding as for 
cutting planes. c, Head marker Dindk expression. 
d, Tail marker Dlac-wnt11-2 expression. e, Brain/ 
central nervous system (anti-synapsin, green) and 
mitoses (anti-H3P, red) immunostaining. 

0 Representative examples of uncut D. lacteum 
(left), head-regenerating trunk (top right) or 
regeneration-impaired tail pieces (bottom right) at 
14 d.p.a., n as indicated. Asterisk, pharynx. Scale 
bars, 500 tum. 
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from S. mediterranea (Supplementary Fig. 3a, b). Furthermore, we 
frequently find multiple distinct D. lacteum transcripts orthologous 
to a given S. mediterranea gene (for example, Supplementary Fig. 3c), 
indicating either high gene-specific haplotype divergence rates or mul- 
tiple gene duplications in D. lacteum. Cloning and expression analysis 
of a planarian head marker’’ (Dlac-ndk) (Fig. 1c) and a tail marker'® 
(Dlac-wnt11-2) (Fig. 1d) revealed that neither was expressed in tail 
pieces at 14 days post amputation (d.p.a.). Phospho-histone H3 (H3P) 
staining as proxy of stem cell activity’? was present all along the 
anterior—posterior axis (Fig. le), confirming previous indications that 
stem cells are equally distributed along D. lacteum’s anterior—posterior 
axis*!*""*, Despite ectopic looping of ventral nerve cords instead of 
regenerating a bi-lobed brain (Fig. le), tail fragments survived for 
more than 3 months without ever displaying any overt signs of head 
regeneration (Supplementary Fig. 4). D. lacteum’s posterior head 
regeneration inability therefore probably originated from a specific 
defect in regeneration, rather than a kinetic delay. 

Regeneration is a multistep process, entailing (1) appropriate 
wound responses, (2) cell proliferation/blastema formation and (3) 
blastema fate specification”. To delimit which of these processes might 
be compromised, we carried out detailed RNA-seq time-course experi- 
ments of head-regenerating wounds in the anterior body half (‘trunk’) 
versus non-regenerating wounds in the posterior body half (‘tail’) 
(Fig. 2a)'*. A recent S. mediterranea study identified planarian 
early wound response genes*'. All scoreable D. lacteum orthologues’® 
showed remarkably consistent expression kinetics at trunk and tail 
wounds, indicating that D. lacteum tail pieces have a functional wound 
response (Fig. 2a). To assess whether blastema formation might be 
defective, we compared the accumulation of Dlac-piwi-1-expressing 
stem cells at wound sites and their mitotic activity’. As shown in 
Fig. 2b, the accumulation of Dlac-piwil™ cells and elevation of H3P* 
cell numbers occurred to comparable extents at trunk and tail wounds 
during the first 3 d.p.a. Histological sections through the wound area at 
day 3 of regeneration revealed a gap in the basement membrane and 
the accumulation of small, undifferentiated cells (Fig. 2c), confirming 
successful blastema formation. Regeneration of the body edge, as 
assayed by in situ hybridization with the D. lacteum orthologue of 
Smed-laminB”, further suggested that tail-piece blastemas are func- 
tional and capable of replacing lost cell types (Supplementary Fig. 5). 
Thus, in agreement with previous studies'*""*, these data demonstrate 
that the D. lacteum tail regeneration defect is not due to defective 
blastema formation. 

To explore the remaining possibility of aberrant blastema fate 
specification, we systematically analysed the expression kinetics of a 
library of head and tail marker genes (Supplementary Table 1). Trunk 
wounds showed the expected upregulation of head markers above 
their low baseline expression at the amputation level (Fig. 3a, top), 
even though response kinetic differed from similar experiments in 
S. mediterranea (Supplementary Fig. 6). In situ hybridization (Sup- 
plementary Fig. 7) and independent quantitative PCR (Supplemen- 
tary Fig. 8) confirmed the RNA-seq measurements. Remarkably, tail 
wounds failed to induce any of the head markers examined, not 
even the very early anterior fate markers notum”’ or sFRP1 (ref. 6) 
(Fig. 3a, bottom). Concomitantly, D. lacteum tail pieces failed to down- 
regulate markers of tail identity (Supplementary Table 1), again in 
sharp contrast to the rapid and sustained downregulation of these 
genes in head-regenerating S. mediterranea tail pieces cut at the same 
level (Fig. 3b). These results therefore demonstrate a twofold fate 
specification defect at D. lacteum tail wounds, (1) complete failure to 
establish anterior/head identity and (2) inappropriately maintained 
tail identity. 

In planarians, head and tail identity are mutually exclusive due to 
opposite dependence on canonical Wnt signalling/B-catenin. Inhibi- 
tion of B-catenin is necessary and sufficient for head induction, 
whereas activation of B-catenin is necessary and sufficient for tail 
specification® *. Failed head establishment under sustained tail marker 
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Figure 2 | Posterior D. lacteum wounds form a blastema. a, RNA-seq time- 
course analysis of wound response in trunk or tail. Heat map of log, fold- 
change relative to fy expression levels of the indicated genes. b, Accumulation of 
Dlac-piwil-expressing cells (red) at wounds of representative trunk or tail 
fragments (n = 4 per cohort) at indicated d.p.a., DAPI counterstaining (blue). 
Anti-H3P immunostaining (green, right) labels mitoses. Images are maximum 
projected confocal Z-stacks. Scale bar, 200 um. c, Representative haematoxylin/ 
eosin sagittal sections through trunk and tail wounds, 3 d.p.a. (n = 2). BL, 
blastema; BM, basement membrane; d/vEP, dorsal/ventral epithelium. Scale 
bar, 50 um. 


expression could therefore result both from inappropriately high levels 
of Wnt signalling and/or from insufficient Wnt inhibitory capacity. 
We adapted the double-stranded RNA (dsRNA) injection protocol 
from the planarian model species to functionally address these 
possibilities. Simultaneous targeting of the three Wnt ligands failing 
to downregulate in D. lacteum tail wounds (Fig. 3b) did not rescue 
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a D. lacteum trunk b D. lacteum tail Figure 3 | Posterior D. Jacteum wounds cannot 
20 re-specify tissue identity. a, RNA-seq time course 
20/ of head marker genes in regenerating trunk (top) 
15 s and regeneration-impaired tail wounds (bottom). 
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head regeneration, but interfered with tail formation at posterior- 
facing wounds (Supplementary Fig. 9). However, increasing pathway 
activity via targeting of the destruction complex member Dlac- 
APCinduced tail regeneration (Supplementary Fig. 10), indicating that 
Wnt signalling levels were not constitutively posteriorizing at the 
amputation level and that at least the three tested Wnt ligands were 
not the cause of the head fate initiation failure. To test a possible 
contribution of insufficient Wnt-pathway inhibition to D. lacteum’s 
head regeneration defect, we specifically downregulated Dlac-f- 
catenin1 (Supplementary Fig. 11). dsRNA injections induced ectopic 
heads on trunk pieces (Supplementary Fig. 12), but, remarkably, also 
rescued head regeneration in 16 out of 24 tail pieces (Fig. 4a). All heads 
scored as rescued were morphologically and behaviourally indis- 
tinguishable from heads regenerated at trunk amputations. Where 
examined, specimens showed normal head marker Dlac-ndk expres- 
sion and morphologically normal bi-lobed brains (Fig. 4b). In contrast 
to headless controls, Dlac-f-catenin1(RNAi) tail pieces were able to 
feed (Fig. 4c), demonstrating the functionality of the heads. Rescued 
tail fragments at 3 months post amputation further showed the same 
anterior—posterior gradation of regenerative abilities as in control 
animals, demonstrating the rescue of global body plan patterning 
(Fig. 4d). 

Our results therefore demonstrate that D. lacteum is not defective in 
regeneration per se, but in the choice of blastema identity. The outcome 
of D. lacteum tail-piece regeneration, that is, newly formed tissue 
devoid of anterior or posterior identity, looping of ventral nerve cords 
(Fig. 1) and expression of boundary marker (Supplementary Fig. 5) are 
strongly reminiscent of the “headless” phenotype previously described 
in S. mediterranea*”*. Jointly, these defects indicate a hierarchical 
structure of blastema fate specification, whereby the initiation of axial 
pole regeneration (head or tail)”* requires further signals beyond those 
required for mere ‘repair’ of lateral wounds. Our data and previous 
results emphasize the central importance of f-catenin at this step, 
signifying head formation when inhibited and tail specification when 
activated. By failing to trigger the B-catenin switch, D. lacteum tail 
pieces fail to initiate pole regeneration and only carry out local damage 
repair. Interestingly, we found occasional D. lacteum escapees from 
such molecular deadlock that regenerated bona fide tails (Sup- 
plementary Fig. 13). Although consistent with a lack of Wnt inhibi- 
tion in D. lacteum tail pieces (Fig. 3b), the low proportion of 
escapees emphasizes that this alone is not sufficient for robust tail fate 
establishment. 
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What prevents the binary choice capability of the B-catenin system 
from activating in D. lacteum tail pieces (or during normal damage 
repair) remains an open question. One possible explanation is the failed 
upregulation of the Wnt inhibitor notum in D. lacteum tails (Fig. 3a, 
Supplementary Fig. 14), which is critical in S. mediterranea head spe- 
cification”*. A further interesting observation is the greatly reduced 
induction of wntl during the D. lacteum early wounding response 
(Supplementary Fig. 14). In S. mediterranea, wnt1 expression increases 
rapidly and strongly at any wound’*”’ and contributes, at anterior 


Control 


Rescue (Dlac-$-catenin1 (RNAi) 


Figure 4 | Rescue of the D. lacteum regeneration defect by f-catenin1 
(RNAi). a, Live images of a representative control (top) or S-catenin1(RNAi) 
tail piece (bottom) at indicated d.p.a. Rescue was observed in 16 out of 24 cases 
from two independent experiments. b, Rescued head marker expression 
(Dlac-ndk, left) and brain/central nervous system (anti-synapsin staining, 
right) in B-catenin1(RNAi) tail pieces (n = 3/3). ¢, B-catenin1(RNAi) tail pieces 
(left), but not controls (right), ingest phenol-red-labelled food (n = 2/2). 

d, Restoration of graded regeneration competence in rescued tail piece 

(n = 2/2). Fragments in b-d were at 3 months post amputation. Scale bars, 
500 um. 
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facing wounds only, to notum upregulation as part of a negative feed- 
back loop”. The functional significance of the polarity-independent 
Wntl response remains unclear, but our observations raise the 
interesting possibility that a Wnt signalling burst early in regeneration 
might prime the f-catenin system for subsequent fate choices, thus 
contributing to the robustness of regeneration in species that have it. 

Overall, the fact that the global inhibition of a signalling pathway 
can reverse a species-specific regeneration defect remains astonishing. 
This result shows that rebuilding complex body parts may not always 
necessitate understanding and manipulation of myriad individual 
signals, but possibly a small number of network nodes only. 
Reactivating regeneration in species that, unlike D. lacteum, have 
entirely lost the ability, certainly constitutes a significantly greater 
challenge. However, our results provide a proof of principle towards 
the utility of comparative approaches in understanding and potentially 
rescuing species-specific regeneration defects. 


METHODS SUMMARY 

Transcriptome assembly, RNA-seq and cloning. The Dresden D. lacteum 
(ddDlac) transcriptome was de novo assembled using Trinity’? and comprises a 
total of 49,394 transcripts, with the half-point of the contig length distribution 
(N50) at 1,570 base pairs. For RNA-seq time courses, single replicates of poly-A 
selected RNA from dissected wound areas of 8 fragments/time point were 
sequenced to a depth of 20-30 million reads per time point and mapped to the 
ddDlac assembly’®. For wound response analysis, D. lacteum orthologues of known 
S. mediterranea wound-response genes” were identified using reciprocal BLAST 
against the ddDlac assembly. All orthologues displaying > twofold upregulation for 
at least one time point and > 0 aligned reads at the zero time point were analysed. 
For marker analysis, D. lacteum transcripts were PCR-amplified from D. lacteum 
cDNA and cloned into pPR-T4P™. For primers, see Supplementary Table 2. 

In situ hybridization, immunohistochemistry and histology. For details on in 
situ hybridizations, see ref. 15. Immunocytochemistry was done on post in situ 
hybridization animals, using rabbit anti-H3P (1:250, Millipore), mouse -synapsin 
(3C11, 1:10, Hybridoma Bank) and Alexa Fluor secondary antibodies. Histology 
specimens were killed and fixed with Bouin’s solution, transferred to 70% ethanol 
and cleared in clove oil before paraffin embedding. 5 j1m sections were attached to 
gelatine-coated slides, stained in haematoxylin/eosin and mounted in DPX. All 
wide field images were taken on an IFMZ100 microscope (Nikon) equipped with a 
colour camera; confocal images on a Zeiss LSM700. 

Animal husbandry, RNAi and amputations. D. lacteum were maintained at 
20°C in a defined salt solution and expanded using prepharyngeal bisection”, 
using specimens of < 12 mm length for experiments. Double-stranded RNA for 
injections were synthesized using templates of 750 bp PCR products with added 
T7 promoter sequences’. For primers, see Supplementary Table 2. dsRNA was 
injected into cold-immobilized D. lacteum three times every second day at 
4g pl! final concentration before amputation 3 days post last injection. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


RNA-seq. Total RNA was isolated using the TRIzol (Invitrogen) protocol 
described in the online supplement of ref. 30, quality controlled on a 
Bioanalyzer (Agilent), poly-A selected using Sera-Mag Oligo(dT) beads 
(Thermo Scientific) and processed for Illumina Sequencing (HiSeq 2000). 
Pooled intact and day 1-6 regenerating D. lacteum served as RNA source for 
~400 million 100-bp paired-end reads; dissected wound areas at different regen- 
eration stages (below) for the ~ 550 million 76-bp single-end reads. 

Transcriptome assembly. The R2 mate pairs of paired-end reads were trimmed to 
65 bp from 3’ to 5’ with the FASTX Trimmer (Toolkit version 0.0.13) due to poor 
sequence quality. All paired-end reads were further filtered with Cutadapt (version 
0.9.5) and low-complexity reads were removed with a customized script. The 
remaining 354 million high-quality paired-end reads and 521 million 76-bp 
single-end reads from time-course experiments (below) were processed with 
Trinity”, version trinityrnaseq_r2012-06-08, and assembled using default settings. 

The 189,820 transcripts of the primary assembly were corrected for strand 
orientation using the directional bias in the single-end fraction of input reads. 
Directional reads were aligned against the assembly using Bowtie2* (version 
2.1.0), and all transcripts with >70% of aligning reads in the forward direction 
(5’ to 3’) were reverse-complemented. Second, to correct for a class of assembly 
artefacts resulting in frame shifts, we re-mapped the input reads against the 
assembly, called putative insertions/deletions (InDels) using SAMtools mpileup”’, 
filtered for InDel positions with an alternate allele frequency of greater than 75% 
and a minimum coverage of 8, and adjusted the resulting transcript sequences 
according to the replacement pattern suggested by SAMtools. 

The primary transcriptome was annotated using the following tools: (1) Getorf 
from the EMBOSS tools suite** (v6.4) was used to detect open reading frames 
(ORE) longer than 100 nucleotides, allowing ORFs spanning one or both tran- 
script bounds to not exclude fragmented transcripts. (2) Domains were predicted 
using InterProScan** 5 RC4 for all ORFs containing at least 40 amino acids. (3) 
Transcripts were searched against the RefSeq protein database, using BLASTX*° 
(v2.2.27) with an e-value cutoff of 0.0001 and allowing for maximum 100 hits per 
transcript. Only high-scoring segment pairs (HSPs) with a query and subject 
coverage of at least 40% and a per cent similarity score of 40 or greater were 
retained for annotation purposes. 

To generate an assembly subset enriched in cDNAs, we filtered the primary 
assembly on basis of conforming to any one of the following criteria: (1) ORF 
length longer than 225 nucleotides (75 amino acids); (2) presence of at least one 
predicted domain or (3) BLASTX homology to a RefSeq entry. In case of multiple 
transcripts variants, only the longest graph component was retained. The resulting 
Dresden D. lacteum (ddDlac) assembly comprises a total of 49,394 transcripts, 
with the half-point of the contig length distribution (N50) at 1,570 base pairs (see 
Supplementary Fig. 1). 

RNA-seq time course experiments. 8 fragments per time point (0, 4, 12, 16, 24, 
48, 72 and 120h post amputation) were killed in 1% HCl for 1 min, washed with 
PBS 2x, transferred into RNAlater (Life Technologies) before dissection of the 
immediate wound area on a cold plate. Dissected tissue was transferred into 
TRIzol (Invitrogen) and RNA extraction/sequencing was carried out as above. 
Each time point was sequenced at a depth of 20-30 million single-end 76 bp reads, 
post-processed and aligned to the ddDlac assembly with bowtie** (version 0.12.7) 
using -ml -v2 options. The number of aligned reads/transcript were converted 
to RPKM (reads per kilobase per million mapped reads), and, for fold-change 
normalization, divided by the RPKM value at the zero time point. Data analysis 
and graphing were done in R (version 2.15.3). 

Wound response analysis. To compare the D. lacteum wound response to the 
published S. mediterranea data set”', the S. mediterranea wound response gene 
sequences were used to identify their D. lacteum orthologues using reciprocal 
BLAST against the ddDlac assembly. All orthologues showing > twofold 


LETTER 


upregulation for at least one time point and > 0 mapped reads at the zero 
time point were included in the analysis. 

Gene identification and cloning. D. lacteum orthologues of planarian marker 
genes were identified by reciprocal BLAST for S. mediterranea, Schmidtea 
polychroa or Dugesia japonica transcripts. CDNA fragments were PCR-amplified 
from D. lacteum cDNA and cloned into pPR-T4P**. For primers, see Sup- 
plementary Table 2. 

In situ hybridization, immunohistochemistry and histology. In situ hybridiza- 
tions were carried out essentially as described”’, except that animals were killed in 
500 mM MgCl, for 90 s before N-acetyl cysteine treatment and the reduction time 
was extended to 20 min. Immunocytochemistry was carried out on post in situ 
hybridization animals, using monoclonal rabbit anti-H3P (1:250, Millipore), 
monoclonal mouse anti-synapsin (3C11, 1:10, Hybridoma Bank) and Alexa 
Fluor-conjugated secondary antibodies raised in goat. For histology, the 
specimens were killed and fixed with Bouin’s solution and stored in 70% ethanol. 
Fixed specimens were cleared in clove oil before paraffin wax embedding, section- 
ing (541m) and mounting on gelatine-coated slides. Sections were stained in 
haematoxylin/eosin and mounted in DPX. All wide-field images were taken on 
a IFMZ100 microscope (Nikon), equipped with a Ds-Fil colour camera (Nikon). 
Z-stacks were acquired on a Zeiss LSM700 confocal and maximum-projected 
using Fiji*. 

Animal husbandry, RNAi and amputations. D. lacteum were maintained at 
20°C as described and expanded using prepharyngeal bisection. Double- 
stranded RNA against Dlac-$-catenin1, Dlac-APC, Dlac-wnt11-1, -2, -5 and a 
planarian codon usage optimized mCherry sequence (negative control) were 
synthesized using 750 bp PCR products with added T7 promoter sequences as 
template (see Supplementary Table 2 for primer sequences). dsRNA synthesis was 
carried out as described*’, but omitting the RNase III digestion. dsRNA was 
resuspended in RNase free water at a concentration of 4 1g pil’ and injected into 
cold-immobilized D. lacteum three times every second day. Injected animals were 
amputated 72h after the last injection. 

qPCR. RNA was extracted as described above and cDNA synthesis was done 
with the SuperScript III First-Strand Synthesis System (Invitrogen). qPCR was 
performed with Absolute QPCR SYBR Green Mix Plus ROX system (Thermo 
Scientific) according to the manufacturer's recommendations. See Supplementary 
Table 2 for primer information. 
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Dual-mode operation of neuronal networks involved 


in left-right alternation 


Adolfo E. Talpalar’*, Julien Bouvier’, Lotta Borgius', Gilles Fortin’, Alessandra Pierani? & Ole Kiehn! 


All forms of locomotion are repetitive motor activities that require 
coordinated bilateral activation of muscles. The executive elements 
of locomotor control are networks of spinal neurons that determine 
gait pattern through the sequential activation of motor-neuron pools 
on either side of the body axis’ *. However, little is known about the 
constraints that link left-right coordination to locomotor speed. 
Recent advances have indicated that both excitatory and inhibitory 
commissural neurons may be involved in left-right coordination*”. 
But the neural underpinnings of this, and a possible causal link between 
these different groups of commissural neurons and left-right alter- 
nation, are lacking. Here we show, using intersectional mouse genetics, 
that ablation of a group of transcriptionally defined commissural 
neurons—the VO population—leads to a quadrupedal hopping at all 
frequencies of locomotion. The selective ablation of inhibitory VO 
neurons leads to a lack of left-right pattern at low frequencies, 
mixed coordination at medium frequencies, and alternation at high 
locomotor frequencies. When ablation is targeted to excitatory VO 
neurons, left-right alternation is present at low frequencies, and 
hopping is restricted to medium and high locomotor frequencies. 
Therefore, the intrinsic logic of the central control of locomotion 
incorporates a modular organization, with two subgroups of VO 
neurons required for the existence of left-right alternating modes 
at different speeds of locomotion. The two molecularly distinct sets 
of commissural neurons may constrain species-related naturally 
occurring frequency-dependent coordination and be involved in 
the evolution of different gaits. 

VO neurons, characterized by the early expression of the transcrip- 
tion factor Dbx1 (refs 8, 9), represent a major class of commissural 
neurons in the ventral spinal cord, where the locomotor network is 
localized’. Previous experiments have shown that knockout of the 
Dbx1 gene leads to an intermittent disruption of, but a preserved 
possibility for, left-right alternation during drug-induced locomotion 
in vitro’. These experiments also indicated an exclusive role for inhib- 
itory commissural neurons in securing left-right alternation. Here, we 
tested in vivo and in vitro the possibility that transmitter-defined subsets 
of commissural neurons may condition left-right alternation at distinct 
locomotor frequencies using the Dbx 1!0*?-stop-T0xP-DTA (Thy PTA. DTA, 
diphtheria toxin A)"’ mouse strain that allows conditional ablation of 
V0 cells. Crossing the Dbx1?™ line with generic Cre mouse lines led 
to lethal genotypes owing to breathing impairements’*. However, 
when Dbx1?™ mice were mated with the E1Ngn2::Cre line’’, where 
Cre is expressed in the spinal cord but has limited impact on 
the respiratory centres, EINgn2::Cre;Dbx1?™ pups survived". Juvenile 
mutant animals (3-4 weeks) showed a pronounced quadrupedal hop- 
ping, with sequential synchronized lifting (phase values around 0 or 1) 
of the forelimbs followed by synchronized lifting of the hindlimbs 
(Fig. la, bottom, and Supplementary Video 1). This behaviour was 
never seen in age-matched wild-type mice at comparable frequencies 
of locomotion, in which alternating gait patterns (phase values around 
0.5) dominated (Fig. 1a, top, and Supplementary Video 2). 


The VO population is composed of dorsal (VOp) and ventral (V0y) 
neuronal sub-classes that arise respectively from the dorsal (Pax7*) and 
ventral (Pax7) subdomains of Dbx1-expressing (p0) progenitors*”. 
We found that the El enhancer of Ngn2 drives Cre expression in the 
majority of Dbx1* progenitors of both subdomains (Fig. 1b, c). 
Accordingly, EINgn2::Cre;Dbx1?™ embryos show a marked reduc- 
tion in Dbx1* progenitors and Evx1-expressing neurons, which are a 
known VOy contingent (Fig. 1d, e), whereas adjacent progenitors (pd6, 
pl and p2a) were spared (Supplementary Fig. 1). Together, these find- 
ings indicate that the quadrupedal hopping is due to a specific elimi- 
nation of both VOp and VOy neurons. 

To rule out the possibility that the E1Ngn2::Cre;DbxI-"* phenotype 
may result in part from the loss of VO neurons in supraspinal structures, 
we used isolated spinal cords from newborn mice (Fig. 1f), in which 
locomotor-like activity can be induced by the exogenous application of 
neuro-active substances such as NMDA (N-methyl-p-aspartate) and 
serotonin, or 5-hydroxytryptamine (5-HT). In such conditions, wild- 
type preparations show a typical pattern of hindlimb locomotor-like 
activity characterized by left (L)-right (R) alternation between corres- 
ponding flexor-related lumbar (L)2 roots (LL2-RL2) and extensor- 
related lumbar 5 roots (LL5-RL5), and flexor—extensor alternation 
(LL2-LL5; RL2-RL5) (Fig. 1g, top; seen as phase values around 0.5 
in circular plots in Fig. 1g, bottom). By contrast, preparations from 
E1Ngn2::Cre;Dbx1?™ animals (Fig. 1h, top) show rhythmic activity 
characterized by intrasegmental synchrony (phase values around 0 
or 1) (LL2-RL2; LL5-RL5). Flexor—extensor alternation (LL2-LL5; 
RL2-RLS5) was maintained with a larger spread of coordination around 
strict out-of-phase alternating activity as compared with wild-type 
mice (P < 0.001; Fig. 1h, bottom). Further confirmation of the spinal 
origin of the E1Ngn2::Cre;Dbx1?™ phenotype came from examining 
Hoxb8::Cre;Dbx1°?™ animals, in which Cre recombination is restricted 
to spinal segments caudal to cervical segment 4 (Supplementary Fig. 
2a-c)'’. These conditional mutants showed, as did the E1Ngn2::Cre; 
Dbx1?™ mice, a clear hopping phenotype both in vivo and in vitro 
(Supplementary Fig. 2d, e and Supplementary Video 3). The hopping 
phenotype was complete at the hindlimb level whereas episodes of 
alternation were still present in the forelimbs, reflecting the rostro- 
caudal gradient of Hoxb8 expression. Together, these data show that 
major (E1Ngn2::Cre;Dbx1 DTA) oy complete (Hoxb8::Cre;Dbx1 PES) deple- 
tion of VO neurons leads to a hopping locomotor phenotype that is due to 
changes in left-right alternating circuitries contained within the spinal 
cord. Although V0 neurons are not necessary for flexor—extensor alterna- 
tion, their elimination may affect coordination because of long-range 
commissural projections in the lumbar spinal cord’. 

When spontaneously moving with locomotor frequencies ranging 
from 2-10 Hz, juvenile wild-type mice either walk (lower end of fre- 
quencies) or trot (higher end of frequencies)'® (Fig. 2a). At higher 
locomotor frequencies (>10Hz) wild-type mice might gallop’’. In 
the same range of locomotor frequencies at which wild-type mice show 
alternating gaits (walking/trotting), V0-ablated mice consistently 
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showed a quadrupedal (E1 Ngn2::Cre;Dbx1?™, Fig. 2b) or hindlimb 
(Hoxb8::Cre;Dbx1?"™, Supplementary Fig. 2d) hopping gait. Similarly, 
isolated spinal cord preparations from E1Ngn2::Cre;Dbx1”"* (Fig. 2c, d) 
and Hoxb8::Cre;Dbx1?™ (Supplementary Fig. 2e) mice also produced 
hindlimb activity with left-right synchrony at all obtainable locomotor 
frequencies. These experiments show that ablation of VO neurons leads 
to loss of left-right limb alternation along the entire locomotion fre- 
quency domain at which alternation is normally present. 

Unlike in Dbx1-knockout mice", left-right alternation could not 
be restored in E1Ngn2::Cre;Dbx1?' mice by increasing the inhi- 
bitory network drive with uptake blockers of glycine and GABA (y- 
aminobutyric acid) (Supplementary Fig. 3), showing that in the 
absence of VO neurons normal alternating gait is not secured by other 
commissural pathways. These observations point to the differences 
between gene knockout strategies, in which targeted cells may acquire 
a different fate but may still be part of the network, and DTA-mediated 
ablation, which permanently removes the targeted cells. Moreover, the 
blockade of fast glycinergic and GABAergic inhibition in the presence 
of neuro-active drugs produced a left-right and flexor-extensor syn- 
chrony indistinguishable from that seen in wild-type mice (Supplemen- 
tary Fig. 4), indicating that bilateral synchronization is mediated by a 
non-V0 commissural population. 

The VO population is composed of VOy and VOp subpopulations, 
and of excitatory and inhibitory contingents", raising the question of 
how these molecularly and transmitter-defined populations are linked, 
and what their individual contributions are to left-right alternation. In 
the mouse lumbar ventral spinal cord, Pax7-derived VO0p neurons con- 
stitute two-thirds of the VO lineage and are predominantly GABAergic 
or glycinergic inhibitory neurons (Supplementary Fig. 5a—d). Evx1* 
VOy neurons constitute one-third of the VO lineage (Supplementary 
Fig. 5b, c) and largely express Vglut2, the only vesicular glutamate trans- 
porter found in ventral spinal neurons", and no inhibitory transmitter 
markers (Supplementary Fig. 5d-f). Therefore, the genetic partitioning 
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Figure 1 | Ablation of VO neurons leads to a 
hopping gait. a, Locomotor pattern in freely 
moving wild-type and EINgn2::Cre;Dbx1?™ 
mutant mice. b, Partitioning of Dbx1* progenitors 
at E10.5 into dorsal (p0p: Pax7~) and ventral (poy: 
Pax7 ) subdomains, and co-expression of 
E1Ngn2::Cre, in both genotypes. Scale bar: 50 tm. 
c, Corresponding quantifications (E1Ngn2::Cre 
(wild type; WT), N = 3; EINgn2::Cre;Dbx1?™ 
e (mutant; Mut.) N = 3, mean + standard deviation 
CINgn2- (s.d.)). d, Evx1 VOy interneurons and 
mons E1Ngn2::Cre expression in both genotypes at 
E10.5. Scale bar: 50 um. e, Corresponding 
quantifications (E1Ngn2::Cre (WT) N = 3; 
EINgn2::Cre; Dbx1?"™ (Mut.) N = 4, mean = s.d.). 
f, Ventral root recording configuration for in vitro 
experiments. g, h, Top, rectified recordings of 
locomotor-like pattern in left (L) and right (R) 
lumbar (L)2 and 5 ventral roots of wild-type and 
E1Ngn2::Cre;Dbx 1?™ mice. Bottom, 
corresponding circular plots showing the phase 
relationships LL2-RL2 (wild-type, N = 14; 
EINgn2::Cre;Dbx1?™, N = 19), LL5-RL5 (N= 9; 
N= 19) and flexor (F)—extensor (E) (N = 24; 
N=20). 
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of pO progenitors in mice leads to the production of excitatory (VO, 
one-thirds) and inhibitory (VOp, two-thirds) subgroups of VO neurons. 

We first tested the role of the inhibitory, Pax7-derived, VOp neurons 
in bilateral coordination by partnering Pax7::Cre'® with Dbx1?"™ mice. 
This cross efficiently ablated pOp progenitors (Fig. 3a), whereas the 
p0y-derived Evx1* neurons (Fig. 3a, c) and the pd6, p1 and p2 progeni- 
tors were normally produced (Fig. 3b). Pax7::Cre;Dbx1?'* mutants 
die at birth from impaired breathing”, precluding any in vivo investi- 
gations. Nevertheless, hindlimb locomotor-like activity could be induced 
in isolated spinal cords from embryonic day (E)18.5 embryos (Fig. 3d). 
Mutant preparations did not show any coordinated locomotor-like 
activity below 0.20 Hz (Fig. 3e). At medium frequencies (0.20-0.45 Hz), 
left-right alternation coexisted with synchronous activity in flexor- 
related L2 roots, whereas the extensor-related L5 roots showed syn- 
chronization. At the highest obtainable frequencies (>0.45 Hz), the 
flexor-related L2 ventral roots featured clear left-right alternation 
(Fig. 3e, middle). These trends in frequency—phase relationships were 
also seen when phase values were converted to a half-circle and 
weighted regression lines were fitted to the combined data sets (Sup- 
plementary Fig. 6a). The differences in L2 and L5 activity may reflect 
the greater robustness of left-right alternating circuits in the rostral 
compared with the caudal lumbar spinal cord’’. These experiments show 
that the inhibitory VOp neurons are needed for left-right alternation at 
low frequencies but dispensable for alternation at the highest locomotor 
frequencies. In the absence of the VOp population, the remaining com- 
missural neurons do not sustain persistent alternation at medium- 
range frequencies, leading to the presence of both intrasegmental 
alternation and synchrony. 

We next assessed the role of the excitatory V0y commissural neu- 
rons by crossing Dbx1?" with Vglut2::BAC-Cre (Vglut2::Cre) mice’’. 
Cre expression was found in one-third of Dbx1-derived V0 neurons, 
predominantly in Evx1* VOy neurons (Supplementary Fig. 5c-f). 
Accordingly, Vglut2::Cre;Dbx1?'™ animals lacked most Evxl* VOy 
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Figure 2 | V0-deleted mice hop at all locomotor frequencies. a, b, Locomotor 
gait in wild-type and E1Ngn2::Cre;Dbx1?™ mice. Top, frequency—phase 
relationship of wild-type (a; N = 3) and EINgn2::Cre;Dbx1 Ee (b; N = 4) mice. 
Bottom, phase histograms between indicated pairs of limbs. c, Ventral root 
recordings during drug-induced locomotor-like activity in postnatal day (P)0- 
Pl EINgn2::Cre;Dbx1°™ preparations at different frequencies (obtained by 
changing the concentrations of locomotor-inducing drugs). Raw ventral root 
recordings shown in black with superimposed rectified and integrated signals 
in yellow (same in Figs 3d and 4d). d, Corresponding frequency-—phase plots 
where each square represents the mean phase of a significantly oriented phase 
vector (N = 11). Error bars show angular dispersion. The different scales of grey 
indicate low- (<0.2 Hz), intermediate- (0.20-0.45 Hz) and high-frequency 
(>0.45 Hz) regimes here and in Figs 3e and 4e. 


neurons (Supplementary Fig. 7a and Fig. 4a) but there was no change 
in the number of the adjacent Di6, V2a and V1/VO0p neurons (Sup- 
plementary Fig. 7b, c). We did not exclude a possible effect on the small 
population of Dbx1-derived Pitx2-expressing neurons”. Importantly, 
the main respiratory populations were spared, allowing in vivo analysis. 
Juvenile mutant animals showed either a left-right alternating or hop- 
ping hindlimb gait in a frequency-dependent manner: an alternating 
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Figure 3 | VOp-deleted mice show a lack of alternation at low frequencies 
and alternation at high frequencies. a, Pax7-expressing dorsal Dbx1~ 
progenitors (p0p) are ablated in Pax7::Cre;Dbx1°"™ mutants (Mut.) as 
compared with wild type (WT), whereas the ventral pO (pOy, Pax7 ) neural 
progenitors and the VOy neurons expressing Evx1 are spared (N = 3 in each 
group, mean = s.d.). NS, not significant. b, The expression of BhlhbS, a marker 
for pd6, p1 and p2a progenitors at E10.5 (ref. 30), is preserved in Pax7::Cre; 
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pattern was present at low locomotor frequencies (2-4 Hz) and a hop- 
ping gait at high frequencies (4-10 Hz) (Fig. 4b, c and Supplementary 
Video 4). The forelimbs maintained left-right alternation at all speeds. 
This frequency-dependent locomotor phenotype was recapitulated 
in vitro, with locomotor-like activity showing a hindlimb ‘hopping’ 
pattern at the higher and alternation at the lower frequencies (Fig. 4d, e 
and Supplementary Fig. 6b). At intermediate frequencies, flexor-related 
L2 and extensor-related L5 ventral roots showed either alternation or 
synchrony (Fig. 4d, e). Flexor—extensor alternation was close to normal 
at all frequencies. Blockade of fast inhibition in these animals resulted 
in bilateral synchronization (Supplementary Fig. 4c). Thus, glutama- 
tergic VO neurons are necessary for hindlimb alternation at high loco- 
motor frequencies but dispensable at low locomotor frequencies. By 
contrast, at the forelimb level, VOy neurons are dispensable for alterna- 
tion at all frequencies, suggesting that VOp neurons may be the domi- 
nant alternating system in the forelimb locomotor network. 

Using both in vivo and in vitro experiments, we have identified VO 
commissural neurons as necessary for alternating gaits in mice. Together 
with previous studies on Dbx1-independent commissural neurons”, 
our study points to a role for distinct commissural pathways in ensuring 
different modes of left-right locomotor coordination. VO neurons seem 
to be the major executors necessary for left-right alternating gaits, 
whereas other non-V0 commissural neurons promote synchrony and 
presumably secure galloping at high speeds of locomotion in intact mice. 

The VO alternating network relies on distinct molecularly defined 
populations that enforce alternation at the low and high ends of the 
alternating locomotor frequency domain. Studies in newborn rodents*” 
have revealed that contralateral inhibition of motor neurons can be 
mediated both directly, by inhibitory commissural neurons, and indirectly, 
via excitatory commissural neurons acting on local inhibitory inter- 
neurons. The present study elucidates the genetic underpinning and 
shows the functional expression of this dual organization of the altern- 
ating locomotor network, and opposes the generally held notion that 
left-right alternation is mediated by inhibitory commissural neurons 
alone’'*”*. The implication is that the configuration of the left-right 
alternating locomotor network changes with speed. The inhibitory 
VOp neurons are recruited first, during slow locomotion, followed by 
excitatory VOy neurons recruited at higher frequencies. It is conceiv- 
able that at intermediate frequencies of locomotion the two systems are 
active simultaneously. Such speed-dependent changes in recruitment 
order have been described in zebrafish for classes of excitatory inter- 
neurons driving motor neurons during swimming movements™*”». 
The network switch described here is in accord with findings in mice 
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Dbx1?"™ mutant mice (wild type, N = 4; Pax7::Cre;Dbx1?"™, N = 4; 

mean + s.d.) c, VOy Evx1* neurons are spared in Pax7::Cre;Dbx1°™ mutant 
mice at E15.5 (wild type, N = 3; Pax7::Cre;Dbx1?™, N = 3; mean + s.d. 

a-c, Scale bars: 50 um. d, Ventral root recordings during drug-induced 
locomotor-like activity in E18.5 Pax7::Cre;Dbx1?™ preparations at different 
frequencies. e, Corresponding frequency-phase plots (N = 21). Error bars 
show angular dispersion. 
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Figure 4 | Deletion of excitatory VOy neurons causes hopping at high 
locomotor frequencies but preserves alternation at lower frequencies. 
a, VOy (Evx1*) neurons in E15.5 wild-type (WT; N = 3) and Vglut2::Cre; 
Dbx1?™ mutant (Mut.; N = 3) lumbar spinal cord (scale bar: 50 um; 
mean = s.d.). b, Locomotor pattern at medium to high frequencies and 
frequency-phase relationship of hindlimbs (bottom) in Vglut2::Cre;Dbx1 
mice. c, Phase histograms between the indicated pairs of limbs at low (2-4 Hz; 
three panels to the left) and high (4-10 Hz; three panels to the right) frequencies. 
d, Ventral root recordings during drug-induced locomotor-like activity in PO-P1 
Veglut2::Cre;Dbx1?™ preparations at different frequencies. e, Corresponding 
frequency-phase plots (N = 12). Error bars show angular dispersion. 
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that ipsilaterally projecting V2a excitatory neurons project to VOy 
neurons” and that V2a-ablated mice hop” at higher frequencies. These 
findings suggest that ensembles of neurons are selected as modules” at 
different speeds of locomotion. 

The Dbx1/Evx1 transcription factors seem to be phylogenetically 
conserved”’, and a modular system similar to the one described here 
may be found in other vertebrates with alternating gait. Moreover, 
during evolution bipedal and quadrupedal hopping have developed 
in a number of species and some strains of rodents, suggesting that this 
change in locomotor behaviour may be carried by a loss, partial or 
complete, of the V0-determined alternating system. 


METHODS SUMMARY 


All experiments were approved by the local ethical committee. For full details of 
methods used, including detailed information about mice lines, the in vivo and 
in vitro locomotor recordings and analysis, statistical analysis and protocols for 
immunohistochemical analysis, see Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


All experiments were approved by the local ethical committee and performed in 
accordance with European guidelines for the care and use of laboratory animals. 
The following transgenic lines have been described previously: Pax7::Cre'’, 
E1Ngn2::Cre’’, Vglut2::Cre’’, Dbx [oe STOP-10x-DTA (by PTY Dy 1142 (ref. 9); 
Glyt2-GFP*'**; GAD67-GFP****, Hoxb8::Cre’’. All animals were genotyped after 
the experiments with gene-specific primers. 

In vivolocomotor analysis. All in vivo data were recorded using a TSE MotoRater 
system that captures the natural movement of animals on a runway. The animals 
were allowed to walk or run freely in a glass-walled corridor and recorded by high- 
speed camera at 100-200 frames per second. For each mouse, runs from at least 
three independent recordings were analysed using the Clickjoint software. The 
first and the last step in a run and episodes with frequencies lower than 2 strides per 
second were not included in the analysis. Phase values were calculated as the time 
of the swing onset of a limb related to the step cycle length of the reference limb in 
that particular step cycle. One step cycle corresponds to the time from swing onset 
until the next swing onset. 

In vitro experiments. Mice aged E18.5 (Pax7::Cre; Dbx1~°*) or newborn mice 
aged 1-2 days were used in all experiments. The embryonic state was called E0.5 
on the morning of plug detection. For analyses at E18.5, fetuses were surgically 
removed from pregnant mice, cooled, decapitated and eviscerated before the brainstem 
and spinal cord were isolated in ice-cold low calcium Ringer’s solution (oxygenated 
95% O2, 5% C02) that contained (in mM): 111 NaCl, 3 KCl, 11 glucose, 25 NaHCO3, 
3.7 MgSO, 1.1 KH2POy, 0.25 CaCl, (pH 7.4). Newborn animals were anaesthetized 
with isoflurane and dissected as described previously**. The isolated spinal cord was 
transferred to a recording chamber that was continuously perfused with normal 
Ringer’s solution and contained (in mM): 111 NaCl, 3 KCl, 11 glucose, 25 NaHCO3, 
1.25 MgSOg, 1.1 KH2PO,, 2.5 CaCl, oxygenated 95% O2, 5% CO; to obtain a pH of 
7A, All recordings were done at room temperature (22-24 °C). 

In vitro recording of locomotor activity and analysis. Locomotor activity was 
recorded with suction electrodes attached to the L2 and L5 lumbar roots on either 
side of the cord. The ventral root activity was band-pass filtered at 100 Hz to 1 kHz. 
The signal was sampled using Axoscope 10 (Molecular Devices) at 1-2 kHz. Data 
points for analysing cycle periods and phases were taken after the locomotor 
activity had stabilized 10-15 min after the initial burst of activity. Locomotor wave 
onsets were automatically detected and calculated based on the intersection between 
the smoothed traces with a horizontal line running through the centre of the trace. 
The cycle periods of the reference trace were normalized to a scale of 360 degrees, 
with the onset of a locomotor burst in the reference ventral root corresponding toa 
phase value of 0° and the onset of the next burst corresponding to a phase value of 
360°. The phase delay of the period cycle onset with respect to the reference was 
calculated for test traces. Epochs of at least 20 consecutive phase values were taken 
from a sample of at least 70 consecutive phase values. Flexor-extensor and left- 
right coordination were evaluated with circular statistics (see ref. 34), in which the 
vector direction gives the preferred mean phase of the activity, and the length of the 
vector (r) the precision of the phase. P values greater than 0.05 as determined by 
Rayleigh’s test were considered non-significant. The angular dispersions (a vari- 
ance measure) were calculated from r values and plotted as error bars**. Values for 
frequency-phase plots in the Veglut2::Cre;Dbx1?' and Pax7::Cre;Dbx1?™ mice 
were also generated using the Spinalcore software” (using Morlet wavelet algorithms) 
to obtain frequencies and phases. The condition for inclusion in these frequency- 
phase plots was that each of the samples (n) involves at least two significantly 
oriented vector relationships between activities in the ventral roots (LL2-RL2 and/ 
or LL5-RLS5 (left-right), and LL2-LL5 and/or RL2-RLS5 (flexor-extensor)). The 
Watson-Williams test for circular data was used to compare conditions”. 

The different locomotor frequencies for the in vitro experiments were obtained by 
varying the concentrations of locomotor-inducing drugs (5-HT in combination with 
NMDA)”. We classified frequency-phase data in groups according to their locomotor 
frequency: high frequency, (f) > 0.45 Hz; medium frequency, 0.45 = f= 0.2 Hz; and 
low frequency, f < 0.2 Hz. These arbitrary limits were set to include half and double 
the average locomotor frequencies obtained during drug-induced locomotor-like 
activity wild-type mice. 

Linear fits were produced to correlate changes in the phase to changes in the 
locomotor frequency (Supplementary Fig. 6). To make these calculations possible, 
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the circular data in Fig. 3e and Fig. 4e were converted to semi-circular coordinates 
assuming a central symmetry of data and bimodality for the two opposite states: 
alternation (0.5 turn) and synchrony (0.0 turn). Thus, concentration of the angle 
distribution within the 0.0-0.5 range forces the direction of angular variance to 
contribute to the linear fit’’. Having done this conversion, the weighted linear fits 
using means and semi-circular dispersion data were calculated by the least mean 
squares method on circular data (Supplementary Fig. 6). Significance of the corre- 
lation was additionally assessed with the Spearman’s rank correlation, r,, a test not 
requiring normal distribution of the two variables. 

Drugs. NMDA was used in combination with 5-HT to induce locomotor-like activity 
in vitro. GABA, and glycine receptors were blocked with picrotoxin (10 1M) and 
strychnine (0.5 1M), respectively. Sarcosine (100 11M) and nipecotic acid (120 1M) 
were used to block glycine and GABA uptakes, respectively. All drugs were pur- 
chased from Sigma. 

Immunohistochemistry. Whole embryos (E10.5) or isolated lumbar spinal cords 
(E15.5 to newborn) were fixed for 2 h in 4% (wg/vol) paraformaldehyde in phosphate- 
buffered saline (PBS), rinsed in PBS, cryoprotected in 30% (wg/vol) sucrose in PBS 
overnight, and embedded in OCT mounting medium. Transverse sections (16-,1m 
thick for E10.5/E12.5, or 20 j1m for E15 to postnatal day 1) were obtained on a 
cryostat. Sections were incubated overnight at 4°C with one or several of the 
following primary antibodies diluted in PBS supplemented with 5% (vol/vol) fetal 
bovine serum and 0.5% (wg/vol) Triton X-100: rabbit anti-Dbx1 (1:10,000), chicken 
anti-GFP (Aves Labs 1020; 1:2,000), chicken anti-B-Gal (abcam 9361; 1:1,000), 
guinea pig anti-Evx1 (1:4,000), goat anti-Pax3/7 (Santa Cruz 7748, 1:200), rabbit 
anti-Cre (1:8,000, gift from G. Shutz'’), guinea pig anti-Bhlhb5 (1:20,000, gift from 
B. Novitch*’), guinea pig or rabbit anti-Lbx1 (1:10,000, gift from C. Birchmeier”, 
rabbit anti-GABA (1:1,000, Sigma), sheep anti-Chx10 (1:500, Chemicon), and rabbit 
anti-Pax2 (1:300, Covance). Secondary antibodies were obtained from Jackson 
Immunoresearch or Invitrogen and incubated for 1h at room temperature. Slides 
were rinsed, mounted in Vectashield medium and scanned on a LSM5 confocal 
microscope (Zeiss Microsystems) using a X20 or a X40 objective. Multiple channels 
were scanned sequentially to prevent fluorescence bleedthrough and false-positive 
signals. A contrast enhancement and a noise reduction filter were applied in Adobe 
Photoshop for publication images. 

Cellular counts. For experiments at E10.5, five to ten non-adjacent cryo-sections, 
encompassing 300 to 600 jum of the hindlimb bud area, were counted. For experiments 
at E15.5-E16.5, five to ten non-adjacent sections spanning the rostral lumbar 
spinal cord (T13 to L3) were counted. Neurons expressing the makers of interest 
were counted on the same side, using raw z-stack confocal images obtained with 
the X40 objective and spanning the entire thickness of the sections. Counts were 
done manually with the help of the cell-counter plug-in in ImageJ. Cellular counts 
per section were averaged (or expressed as percentages) per individual animal, and 
the grand mean (or grand mean percentage) + standard deviation was calculated 
across animals to produce cells per hemi-section (or percentage) column plots 
summarizing anatomical staining in the figures. 
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AID stabilizes stem-cell phenotype by removing 
epigenetic memory of pluripotency genes 


Ritu Kumar!, Lauren DiMenna’, Nadine Schrode’, Ting-Chun Liu’, Philipp Franck!, Silvia Mufioz-Descalzo’, 
Anna-Katerina Hadjantonakis®, Ali A. Zarrin*, J ayanta Chaudhuri’, Olivier Elemento” & Todd Evans! 


The activation-induced cytidine deaminase (AID; also known as 
AICDA) enzyme is required for somatic hypermutation and class 
switch recombination at the immunoglobulin locus'. In germinal- 
centre B cells, AID is highly expressed, and has an inherent muta- 
tor activity that helps generate antibody diversity’. However, AID 
may also regulate gene expression epigenetically by directly dea- 
minating 5-methylcytosine in concert with base-excision repair to 
exchange cytosine’. This pathway promotes gene demethylation, 
thereby removing epigenetic memory. For example, AID promotes 
active demethylation of the genome in primordial germ cells*. 
However, different studies have suggested either a requirement* 
or a lack of function’ for AID in promoting pluripotency in soma- 
tic nuclei after fusion with embryonic stem cells. Here we tested 
directly whether AID regulates epigenetic memory by comparing 
the relative ability of cells lacking AID to reprogram from a differ- 
entiated murine cell type to an induced pluripotent stem cell. We 
show that Aid-null cells are transiently hyper-responsive to the 
reprogramming process. Although they initiate expression of plur- 
ipotency genes, they fail to stabilize in the pluripotent state. The 
genome of Aid-null cells remains hypermethylated in reprogram- 
ming cells, and hypermethylated genes associated with pluripo- 
tency fail to be stably upregulated, including many MYC target 
genes. Recent studies identified a late step of reprogramming 
associated with methylation status’, and implicated a secondary 
set of pluripotency network components*. AID regulates this late 
step, removing epigenetic memory to stabilize the pluripotent 
state. 

The path to pluripotency involves multiple steps that can be more or 
less efficient, and this can be modelled in transcription-factor-induced 
reprogramming’. Reprogrammed induced pluripotent stem cell (iPSC) 
clones initially retain methylation patterns that may reflect an epige- 
netic memory of the source lineage'*"’. To test directly the function of 
AID in reprogramming, we prepared tail-tip fibroblasts (TTFs) from 
Aid-knockout or control sibling mice. Before reprogramming, the fibro- 
blasts appeared morphologically identical (Supplementary Fig. 1a), and 
were transduced equivalently with lentivirus encoding the four human 
‘Yamanaka’ reprogramming transcription factors’*: OCT4, KLF4, SOX2 
and cMYC (OKSM; Supplementary Fig. 1b). Expression levels of exo- 
genous human OCT4 72 h after infection (before induction of the endo- 
genous murine gene) were equivalent, showing that lack of AID does not 
affect expression vector function (Supplementary Fig. 1c). Exogenous 
transcription factor expression was subsequently equivalently repressed 
in wild-type and Aid-null transduced cells (Supplementary Fig. 1d). 
Using quantitative polymerase chain reaction with reverse transcription 
(RT-qPCR) assays, transcript levels for Aid were not reliably detected 
above background in wild-type TTFs. However, after 1 week of repro- 
gramming, transcripts were readily measured in wild-type cells, increasing 
as much as tenfold during the reprogramming process (Supplementary 
Fig. 2). 


Initial reprogramming steps include the induction of proliferation 
and a change in the morphology of fibroblasts to smaller and rounder 
cells'®. We found that cells lacking AID are initially hyper-responsive 
to reprogramming factors. The change in morphology was more rapid in 
Aid ‘~ cells, beginning at 2 days after transduction. After 4 days, Aid /~ 
cells were rounded and smaller than Aid*’* cells (Fig. 1a). A higher 
fraction of Aid~’~ cells stained positive for SSEA1, an early marker 
for pluripotency™ (Fig. 1b). At day 7, more Aid-null cells expressed 
NANOG compared with controls (Fig. 1c), correlating with modestly 
higher transcript levels for several pluripotency genes before the first 
week of reprogramming (Fig. 1d). Consistent with the molecular data, at 
2 weeks the Aid-null cells consistently generated more early colonies 
than wild-type cells (sixfold more colonies on average over six independ- 
ent experiments). Thus, Aid-null fibroblasts are hyper-responsive to 
reprogramming, even though the growth curves for uninfected wild- 
type and Aid-null cells were indistinguishable (Supplementary Fig. 3a). 
Rather, the enhanced expression of pluripotency markers in Aid /~ 
cells suggests that AID normally helps to stabilize the differentiated 
state, creating a barrier to the initial process of reprogramming. When 
the Aid ’~ fibroblasts were passaged before transduction, this hyper- 
responsiveness was no longer seen (Supplementary Fig. 3b), suggesting 
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Figure 1 | Cells lacking AID are initially hyper-responsive to transcription- 
factor-based reprogramming. a, Aid‘’" and Aid ‘~ fibroblasts after 4 days of 
OKSM transduction. Note the more rounded appearance of null cells. Original 
magnification, X10. b, Cells positive for SSEA1 after 4 days of OKSM 
transduction, determined by flow cytometry. c, Immunostaining with anti- 
OCT4 and anti-NANOG antibodies after 1 week of OKSM transduction. 
Original magnification, x40. d, Relative transcript levels of pluripotency genes 
determined by qPCR after 4 days of OKSM transduction. For each gene, 
transcript levels were normalized to Aid*’* cells set to a value of 1. Data 
represent the mean + standard error of the mean (s.e.m.) from three 
independent experiments (*P < 0.05, **P < 0.01). 
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that passive removal of epigenetic marks through DNA replication can 
normalize the initial response to OKSM. 

Although there were more cells, no obvious differences were 
observed at 2 weeks in the morphologies of iPSC-like colonies derived 
from Aid ‘~ cells compared with Aid*’* cells, and both stained pos- 
itive for the pluripotency marker NANOG (Supplementary Fig. 4). 
However, by 4 weeks the colonies derived from Aid ‘~ fibroblasts 
were flattened with less defined edges (Fig. 2a). We tracked individual 
iPSC-like colonies and observed many Aid ‘~ colonies that appeared 
pluripotent at 3 weeks but showed a differentiated morphology at 4 
weeks (Fig. 2b). At 3 weeks, the Aid ’~ colonies showed a ‘patchy’ 
NANOG pattern, and by 4 weeks most colonies were differentiated, 
with few NANOG' cells (Fig. 2c). This highly reproducible phenom- 
enon was never observed for colonies derived from Aid*!* cells, which 
retained their iPSC morphology and NANOG expression throughout 
the 4 weeks (Fig. 2b, c). The colonies derived from Aid ‘~ cells showed 
a progressive decline in the frequency of cells expressing SSEA1 
(Fig. 2d) and OCT4 (Fig. 2e). The same results were observed using 
mouse embryonic fibroblasts (MEFs), as the Aid-null cells failed to 
maintain a pluripotent iPSC phenotype (Supplementary Fig. 5). The 
stabilization of pluripotency was equally effective in wild-type cells 
(and ineffective in Aid-null cells) regardless of whether constitutively 
expressed or doxycycline-inducible OSKM cassettes were used (Sup- 
plementary Fig. 6a). 

Using either wild-type or Aid-null cells, the expression of OKSM 
using a doxycycline-inducible vector for only 6 days was insufficient to 
generate pluripotent colonies. OKSM expression for 9, 12, 21 or 28 
days was sufficient to generate pluripotent colonies, showing a ‘dose 
response’ in the wild-type fibroblasts, but was equally ineffective for 
generating stable pluripotent cells from Aid-null cells at the 4-week 
time point (Supplementary Fig. 6b, c). When AID was expressed along 
with OKSM during initial reprogramming stages, this failed to rescue 
pluripotency (data not shown). However, when OKSM-transduced Aid- 
null cells were secondarily infected with an AID-expressing retrovirus 
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Figure 2 | AID stabilizes pluripotency. a, Aid ‘~ cells lose ES-cell-like 
morphological characteristics 4 weeks after OKSM transduction. b, Aid~/~ 
iPSC-like colonies progressively lose this phenotype. wk, weeks. c, Mutant cells 
lose NANOG expression. a-c, Original magnification, X10. d, e, Cells that 
stained positive for SSEA1 (d) or OCT4 (e) were measured by flow cytometry 
after 1, 2, 3 or 4 weeks of OKSM transduction; n = 3 independent experiments, 
error bars denote standard deviation. NS, not significant. 
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after 1 week (Supplementary Fig. 7a, b), there was a partial but statist- 
ically significant rescue of pluripotency markers stabilized at the 4-week 
time point. Because the rescue was not complete we cannot rule out 
additional molecular mechanisms, beyond AID, that also contribute to 
the stabilization of pluripotency. Expression of AID in wild-type cells 
further enhanced the stable expression of pluripotency markers at 4 
weeks (Supplementary Fig. 7c). As Nanog is syntenic with the Aid locus, 
an independent set of experiments used fibroblasts derived from an Aid- 
knockout strain in which the neoR gene was deleted'*. These ‘clean’ 
knockout cells also failed to derive stable iPSC colonies (data not shown). 

It was previously reported that iPSCs derived from different cell 
types retain an epigenetic memory of their somatic phenotype'®”’. 
We tested whether genome replication through passaging could sta- 
bilize a pluripotent phenotype even in the absence of AID. We gener- 
ated iPSC-like colonies and isolated 13 Aid*'* and 12 Aid ‘~ colonies 
(clones) at 3 weeks that appeared pluripotent by morphology. After 
three passages (p3) all the clones, regardless of genotype, retained iPSC- 
like morphology and stained positive for OCT4 and NANOG (Fig. 3a). 
However, between p7 and p10, 5 out of 12 Aid ‘~ clones failed to retain 
pluripotency and differentiated (Fig. 3b). All 13 Aid*’* colonies and the 
other 7 Aid ’~ clones that were stable beyond p10 retained pluripo- 
tency until at least p50. All iPSC clones that retained morphology 
beyond p10 formed embryoid bodies equivalently and could differenti- 
ate into cells expressing smooth muscle actin (mesoderm), BI tubulin 
(ectoderm) and «-fetoprotein (endoderm), regardless of Aid genotype 
(Fig. 3c). In an independent set of experiments, wild-type and Aid-null 
clones were picked and either maintained in culture as colonies or 
passaged. The non-passaged wild-type-derived clones maintained a 
pluripotent morphology, whereas the Aid-null cells differentiated. 
With passaging, 60% of the Aid-null clones were able to maintain a 
pluripotent morphology (Supplementary Fig. 8). 

Thus, although AID is not essential for reprogramming, an import- 
ant transition occurs at around 3 weeks that is assisted by AID to 
stabilize the pluripotent phenotype. We considered whether the previ- 
ously described DNA-demethylating role of AID might be at least 
partially responsible for this phenotypic stabilization. We profiled 
the epigenome by carrying out reduced-representation bisulphite 
sequencing (RRBS) in Aid-null and control cells after 3 weeks of 
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Figure 3 | Cells lacking AID reprogram inefficiently, but those that do are 
pluripotent. a, b, Immunostaining of iPSC colonies isolated 3 weeks after 
OKSM transduction and passaged three times (a) or seven times (b) before 
staining. In this example, the null cells failed to from stable iPSCs. ¢, Stable iPSC 
clones were passaged more than ten times. In this case (unlike in b) the 
passaged Aid-null cells maintained NANOG expression (data not shown) and 
generate embryoid bodies (EBs) that differentiate into cells positive for smooth 
muscle actin (SMA), BIT tubulin (BIII TUB) and «-fetoprotein (AFP). Original 
magnification, X10. 
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reprogramming. We observed global DNA hypermethylation in the 
Aid-null cells (Supplementary Fig. 9a), when we considered either total 
CpGs or differentially methylated regions (DMRs). DNA hypermethy- 
lation occurs preferentially near RGYW motifs (Supplementary Fig. 9b), 
the characteristic DNA-targeting sites for AID (45% of the hypermethy- 
lated CpGs). Notably, as AID targets gene bodies, the hypermethylated 
regions are enriched in gene bodies, even though RRBS is biased towards 
capturing CpG-rich promoter and enhancer regions (Supplementary 
Fig. 9c). 

Transcript profiles comparing the original, pre-transduced fibro- 
blasts from Aid*'* and Aid '~ embryos closely match (Supplemen- 
tary Fig. 10a). By contrast, many genes fail to be upregulated in the 
mutant cells during reprogramming (Fig. 4a, b and Supplementary 
Fig. 10b). The genes that fail to be upregulated during reprogramming 
are highly enriched (P< 1X10 '°) in the gene set showing hyper- 
methylation in Aid-null cells (Fig. 4c). Of note, genes that are upregu- 
lated during reprogramming in both wild-type and Aid-null cells are 
also enriched in hypermethylated genes (P< 1X 10 '“), suggesting 
that many genes overcome the loss of AID during reprogramming. 
Among the set of hypermethylated genes, in Aid-null cells there is a 
failure to upregulate a set of secondary pluripotency genes, including 
Rex1 (also known as Zfp42), Gdf3, Dnmt3l, Cbx7, Zfp296, Dnmt1, 
Apobecl and Tet3 (Fig. 4d). This expression data correlates well with 
the RBBS data, which was validated by MassArray bisulphite sequen- 
cing (Supplementary Fig. 11), and is consistent with a failure of this 
downstream network to be demethylated at late stages of reprogram- 
ming. Comparing the gene set of hypermethylated underexpressed 
genes to public embryonic stem (ES)-cell chromatin immunoprecipi- 
tation and massively parallel DNA sequencing (ChIP-seq) data sets 
revealed a marked enrichment of cMYC target genes in reprogram- 
ming cells lacking AID (Supplementary Fig. 12). According to qPCR 
data there was no difference in cMyc transcript levels in wild-type or 
Aid-null cells at any point in the reprogramming process (data not 
shown). Furthermore, enhanced levels of cCMYC, achieved by retroviral 
transduction starting at 1 week of reprogramming, failed to stabilize 
pluripotency in Aid-null cells (Supplementary Fig. 13), suggesting that 
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Figure 4 | Cells lacking AID fail to activate expression of hypermethylated 
pluripotency genes. a, In the original, pre-transduced fibroblasts few 
differentially expressed genes are off the diagonal. FPKM, fragments per 
kilobase of transcript per million fragments mapped; WT, wild type. b, Many 
genes are underexpressed in Aid-null cells 3 weeks after transduction. c, Fold 
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cMYC access to key target pluripotency genes (due to hypermethyla- 
tion) rather than cMYC levels per se may be a limiting step for stabili- 
zation of the network. 

Individual iPSC-like clones were isolated at 2 weeks and expanded 
to evaluate global methylation patterns for each individual clone. 
Hypermethylation patterns in the Aid-null clones were highly consist- 
ent with data obtained from bulk colony analysis. On the basis of 
genome-wide DNA methylation profiles (with the genome binned 
for 100 kb regions), Aid-null or wild-type clones cluster according to 
genotype (Supplementary Fig. 14a). Regions that were hypermethy- 
lated in the Aid-null bulk analysis tended to be hypermethylated in the 
Aid-null clones (Supplementary Fig. 14b). In fact, 66% of bulk hyper- 
methylated DMRs were also hypermethylated in DNA derived from 
the isolated clones (P<1X10 '*'). Methylation differences found 
previously for the secondary pluripotency genes were largely validated 
by MassArray bisulphite sequencing of DNA (Supplementary Fig. 
14c), and most of the hypermethylated genes were expressed in the 
Aid-null clones at significantly lower levels compared with levels in 
wild-type clones (Supplementary Fig. 14d). 

Altogether, our results show that active demethylation through 
AID-dependent cytosine deamination is important for stabilizing a 
genetic network controlling stem-cell phenotype. AID is not essential 
for reprogramming, because cells lacking AID can form stable plur- 
ipotent iPSCs, either through a passive demethylation process facili- 
tated by DNA replication, or perhaps through compensation from 
related members of the APOBEC family'®. A recent study suggested 
that loss of AID affects transcription-factor-induced reprogramming 
only by an early acute short hairpin RNA (shRNA)-dependent deple- 
tion’’, but did not report the ‘iPSC’ phenotype beyond 3 weeks, when 
we show that Aid-null cells eventually fail to stabilize as pluripotent. 
Unless passaged, Aid-null cells always fail to reprogram. RNA-seq 
profiles of Aid-null cells after 4 weeks of reprogramming cluster with 
fibroblast samples (data not shown), and qPCR analysis validates sig- 
nificant upregulation of fibroblast-associated genes in the Aid-null sam- 
ples (Supplementary Fig. 15), consistent with an epigenetic memory for 
fibroblast fate that fails to be fully removed during reprogramming in 
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enrichment for hypermethylated genes in Aid-null cells. All P values were 
calculated using the hypergeometric distribution. d, Transcript levels for 
hypermethylated pluripotency genes during reprogramming, comparing wild- 
type cells (blue lines) with Aid-null cells (red lines). Data represent the mean + 
s.e.m. from three independent experiments. 
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the absence of AID. The function of AID for transition to a stable 
pluripotent phenotype may be relevant during embryogenesis, as Aid- 
knockout strains have small litters. We found that ES-cell lines could 
be established from Aid-null blastocysts, and these appeared by mor- 
phology and staining patterns to be normal (Supplementary Fig. 16). 
However, the efficiency of ES-cell-line derivation was markedly reduced 
(17/28 embryos, 60.7%) compared with congenic wild-type embryos 
(21/24, 87.5%; P < 0.03 by chi-squared analysis). 

Although there is much promise for the use of iPSCs for disease 
modelling and cellular therapies"’, there remains concern about whether 
iPSC genomes are damaged through the process of reprogramming”’. 
AID, which has natural mutator activity, is activated during reprogram- 
ming, and mediates demethylation in gene bodies including secondary 
pluripotency genes that encode proto-oncogenes. Although inefficient, 
we showed that iPSCs could be generated in the absence of AID, remov- 
ing epigenetic memory marks through an AID-independent mech- 
anism. Retaining epigenetic memory, in the absence of AID, might be 
useful for promoting efficiency of differentiation towards parental lin- 
eage fate. Furthermore, if AID-independent reprogramming lessens the 
mutation load, it could provide a safer strategy for the generation of 
iPSCs for cellular therapies. 


METHODS SUMMARY 


TTFs were prepared using 0.2% collagenase; MEFs were prepared from embryonic 
day (E)13.5 embryos. Intracellular immunostaining was performed after fixing the 
cells with 4% paraformaldehyde. Surface staining was done without fixing the cells. 
Complementary DNA was prepared using Superscript III and qPCR was per- 
formed (primers listed in Supplementary Table 1) using SYBR green and a 
Roche LightCycler 480 II. All data were from at least three independent experi- 
ments. Enhanced RRBS was performed as described” and bisulphite reads were 
aligned to the bisulphite-converted mm9 genome using Bismark”’. We achieved 
very high coverage by sequencing one full Illumina lane (~200 M, 51 bp reads) per 
sample. For bulk colonies, the wild-type and Aid-null samples (at 3 weeks) were 
analysed with RRBS that covered 1.96 M and 1.82 M CpGs, respectively, with at 
least 10X coverage (with >1.62M CpGs in common between the two runs). 
1.52M and 1.59M CpGs had =20X coverage in wild-type and Aid ’~ cells, 
respectively. 117M and 1.32M CpGs had=50X coverage, respectively. 
Differentially methylated CpGs were identified using the Fisher exact test with 
Benjamini-Hochberg correction for multiple testing. DMRs were defined as con- 
taining at least five differentially methylated CpGs and a total methylation differ- 
ence of more than 10%. Paired-end RNA-seq libraries were constructed as 
described” and sequenced using an Illumina HiSeq2000. Reads were aligned to 
mm9 using TopHat and gene expression levels were quantified using Cufflinks, 
using upper-quartile and GC-content normalizations. Fold changes of 1.5 with 
fragments per kilobase of transcript per million fragments mapped (FPKM) >5 in 
at least one condition were used to derive differentially expressed genes. 
MassArray EpiTYPER analysis was performed on selected regions identi- 
fied as hypermethylated by RRBS. PCR primers (Supplementary Table 2) were 
designed to probe amplicons using EpiDesigner (http://www.epidesigner.com/). 
Bisulphite conversion and MassArray analysis were performed as previously 
described”. Metadata for all RRBS experiments are given in Supplementary 
Table 3. Methylation differences were calculated at each CpG and boxplot analysis 
was performed at each interrogated region. All statistical analyses were performed 
using the R software package. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Summary. Intracellular immunostaining was performed after fixing the cells with 
4% paraformaldehyde. Surface staining was done without fixing the cells. All data 
were from at least three independent experiments. Enhanced RRBS was performed 
as described” and bisulphite reads were aligned to the bisulphite-converted mm9 
genome using Bismark”. We achieved very high coverage by sequencing one full 
Illumina lane (~200 M, 51 bp reads) per sample. For bulk colonies, the wild-type 
and Aid-null samples (at 3 weeks) were analysed with RRBS that covered 1.96 M 
and 1.82 M CpGs, respectively, with at least 10 coverage (with >1.62 M CpGs in 
common between the two runs). 1.52 M and 1.59M CpGs had =20X coverage 
in wild-type and Aid~’~ cells, respectively. 1.17 M and 1.32 M CpGs had =50X 
coverage, respectively. Differentially methylated CpGs were identified using the 
Fisher exact test with Benjamini-Hochberg correction for multiple testing. DMRs 
were defined as containing at least five differentially methylated CpGs and a total 
methylation difference of more than 10%. Paired-end RNA-seq libraries were 
constructed as described” and sequenced using an Illumina HiSeq2000. Mass- 
Array EpiTYPER analysis was performed on selected regions identified as hyper- 
methylated by RRBS. PCR primers (Supplementary Table 2) were designed to 
probe amplicons using EpiDesigner (http://www.epidesigner.com/). Bisulphite 
conversion and MassArray analysis were performed as previously described”. 
Metadata for all RRBS experiments are given in Supplementary Table 3. 
Methylation differences were calculated at each CpG and boxplot analysis was 
performed at each interrogated region. All statistical analyses were performed 
using the R software package. 

Mice. AID-deficient (Aid ‘~) mice were a gift from T. Honjo. Some of the repro- 
gramming experiments were performed on the Aid ‘~ and Aid*'* littermates 
derived by crossing Aid*’~ mice, and for some of the experiments wild-type 
BALB/c mice were purchased from the Jackson Laboratory. Reprogramming 
experiments were performed on tail fibroblasts obtained from seven Aid~/~ and 
seven Aid‘’* age-matched mice (four littermates and three non-littermates, 
each). We observed similar results when comparing littermates or the purchased 
mice. Mice or embryos used for MEFs were genotyped by PCR. All animals were 
maintained according to the guidelines for animal welfare of the Memorial Sloan- 
Kettering Research Animal Resource Center. 

Preparation of fibroblasts. For preparation of tail fibroblasts either whole tails or 
tail tips were collected from age-matched adult Aid*/* and Aid~/~ mice (age 
range, 3-8 weeks). Tails or tail tips were minced using a razor blade/scalpel after 
washing with PBS. For preparation of MEFs, Aid*/~ males and Aid*/~ females 
were time mated and E13.5 embryos were collected. Embryos were washed several 
times and extra care was taken to prevent maternal cross contamination. After 
removing the head and all internal organs embryos were minced using a razor 
blade/scalpel. Minced tails or embryos were washed twice with DMEM (Cellgro) 
containing 10% FBS (Gemini Bio Products), 1mM t-glutamine (Cellgro), 
100 Uml * penicillin (Cellgro) and 10 1g ml * streptomycin (Cellgro), and disso- 
ciated using 0.2% collagenase from Clostridium histolyticum (Sigma) at 37 °C for 4h 
(tails) or 2h (embryos) with continuous shaking. Dissociated cells were filtered 
through a 70-1m cell strainer (BD falcon) and washed two times with DMEM 
containing 10% FBS and plated on 0.1% gelatin- (Cellgro) coated tissue culture 
plates. Fibroblasts were cultured in DMEM containing 10% FBS, 1mM 
L-glutamine, 100 U ml’ penicillin and 100 1g ml! streptomycin and media was 
replaced daily. After passaging two times, fibroblasts were frozen in media contain- 
ing 10% DMSO, 40% FBS and 50% DMEM. Generally, p2 fibroblasts were used for 
the reprogramming experiments, unless indicated otherwise. Genotype was con- 
firmed by PCR. Primers for wild type, F: 5’°-GGTCCCAGTCTGAGATGTA; R: 5’- 
CAACGTGGCGTCCAAACAGGC; for knockout, F: 5'-CTGCCAAACCTGA 
TGTCTTGA,; R: 5’-AACCAAGCCTATGCCTACAGC. 

Production of lentiviruses and retroviruses. Reprogramming virus was pro- 
duced by transfection of a single lentiviral stem cell cassette (STEMCCA) in 
HEK293T cells using polyethylenimine (PEI) (Polysciences). STEMCCA (con- 
stitutive) expresses all four human factors (OCT4, KLF4, SOX2, cMYC), using a 
single elongation factor 1x (EFla) promoter within a lentiviral vector using a 
combination of 2A peptide and internal ribosome entry site (IRES) strategies. 
Both constitutive and doxycycline-inducible STEMCCA vectors were used for 
the reprogramming studies. For the doxycycline-inducible system, fibroblasts 
were treated with doxycycline (1 1g ml~') starting 1 day after transduction for 
2 weeks. Lentiviruses were produced using a five-plasmid transfection system with 
STEMCCA transfected together with four expression vectors encoding the pack- 
aging proteins Gag-Pol, Rev and Tat, and the G protein of the vesicular stomatitis 
virus (VSV). Five vectors were incubated in DMEM with 0.06% PEI for 20 min and 
then transferred to the plates containing 80% confluent HEK293T cells. 
Transfection was performed in FBS (10%) containing media. After 5h media 
was replaced with fresh media. Viral supernatant was collected after 48 h and spun 
at 3,000 r.p.m. to remove dead cells and filtered through a sterile syringe filter with 
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0.45 tum polyethersulfone membrane (VWR international). Ectopic expression of 
wild-type AID or catalytically mutant AID was achieved using retroviral vectors as 
described previously”*. Retroviruses were produced by co-transfecting the vector 
with the packaging plasmid pCL-ECO in HEK293T cells. The CMYC overexpres- 
sion experiments were achieved using retroviral vector pMXs-c-Myc (Addgene, 
13375). pMXs-c-Myc was co-transfected with vector encoding VSVg in GP2-293 
cells to collect functional virus. 

Transduction with lentiviruses. Five-hundred-thousand fibroblasts were plated 
in one well of a gelatin-coated 6-well tissue culture plate. After 6h, 1 ml of viral 
supernatant containing 8 1g ml! polybrene (Millipore) was added to the fibro- 
blast-containing wells. For every experiment, Aid’ and Aid‘’* fibroblasts were 
infected at the same time with an identical titre of virus, prepared for each experi- 
ment from the same batch. Viral supernatant was removed after 12 h of infection 
and fresh media containing DMEM, 10% FBS, 1 mM t-glutamine, 100Um!"! 
penicillin and 10 jg ml’ streptomycin was added. After 1 day media was replaced 
by mouse embryonic stem-cell (MES) media containing DMEM, 20% ES-cell- 
compatible FBS (Gemini Bio Products), LIF (2% conditioned medium) and 
1.510 *M monothioglycerol (MTG; Sigma), 1 mM 1-glutamine, 100 U ml? 
penicillin and 10 jig ml streptomycin. Cells were maintained in the same media. 
In some of the experiments cells were transferred to mitotically inactivated MEF 
feeders after 2 days of transduction. Approximately 30,000-50,000 cells were 
transferred to one well of a feeder-containing 6-well plate. Isolated iPSC colonies 
were always cultured on feeders. All the cells were kept at 37°C in a humidified 
environment at 5% CO>. 

Immunostaining. Immunostaining was performed on fixed cells (4% PFA in BBS 
with 1mM CaCl, 15min) washed and blocked for 30 min in BBT-BSA buffer 
(BBS with 0.5% BSA, 0.1% Triton and 1 mM CaC],). Cells with primary antibodies 
were incubated overnight at 4°C at the following dilutions: anti-NANOG 
(eBiosciences 14-5761, 1:100, 5 ug ml’) and anti-OCT4 (Santa Cruz 5279, 
1:100, 2 ug ml~ 1). Cells were washed and blocked in BBT-BSA and then incubated 
with Alexa-conjugated secondary antibodies (1:500, from Molecular Probes). 
Vectashield-DAPI was used as a mounting medium. Images were acquired using 
a Zeiss LSM 510-Meta confocal microscope or a Zeiss epifluorescence microscope 
with AxioVision software. For flow cytometric analysis cells were trypsinized, 
fixed with 4% PFA for 20 min, blocked for 1h and then stained in suspension. 
SSEA1 (Santa Cruz, 1:100) staining was performed on unfixed cells. Cells were 
analysed on a BD-Accuri C6 flow cytometer (BD Biosciences) using CFlow Plus 
software. 

RT-qPCR. Cells were trypsinized and collected in Trizol reagent (Life technolo- 
gies). Total RNA extraction was done using the RNeasy Mini Kit (Qiagen). The 
cDNA was synthesized using the SuperScript III First-Strand Synthesis System 
(Invitrogen). RT-qPCR was performed on three experimental replicates using 
SYBR green Master I (Roche). Data were generated on a LightCycler 480 II 
(Roche) and analysed using LightCycler 480 software. qPCR data was calculated 
based on the mean of three experimental replicates. All quantifications were 
normalized to an endogenous Gapdh control. The relative quantification value 
for each target gene was compared to the calibrator for that target gene. All the 
primers used for qPCR spanned introns (Supplementary Table 1). 

In vitro differentiation. For embryoid body formation, iPSCs were passaged two 
times on feeder-free gelatin-coated culture dishes. 5 X 10° iPSCs were plated on a 
10-cm low attachment dish. Embryoid bodies were cultured for 6 days in MES 
media without LIF, and media was changed every other day. For mesodermal 
differentiation, embryoid bodies were replated on gelatin-coated dishes and cul- 
tured for another 7 days in the same media. For neural differentiation, embryoid 
bodies were cultured in media containing DMEM F-12 (Cellgro), 0.5% N2 (Gibco) 
and 0.5% B27 (Gibco) supplements, 1 mM t-glutamine, 1% non-essential amino 
acids (Gibco) and 1.5 X 10 * M monothioglycerol for another 3 days, after which 
embryoid bodies were seeded onto gelatin-coated dishes with the same media, plus 
10M retinoic acid (Sigma) for another 4 days. For endoderm differentiation, 
embryoid bodies were replated onto gelatin-coated dishes in MES media lacking 
LIF but containing 0.5% FBS and 50ngml | Activin (R&D) for 7 days. For all 
differentiation cultures, media was changed daily. 

ES-cell derivation from blastcyst-stage mouse embryos. Aid ’~ males and 
Aid ’~ females (or congenic wild-type pairs as controls) were time mated and 
E3.5 blastocyst-staged embryos were collected in M2 media according to standard 
protocols’®. Embryos were placed on mitotically inactivated MEFs” in knockout 
DMEM (Gibco) containing 15% knockout serum replacement (KSR; Gibco), 2 mM 
L-glutamine, 0.1 mM B-mercaptoethanol, 0.1 mM non-essential amino acids, 1 mM 
sodium pyruvate, 100 U ml penicillin and 100 1g ml streptomycin, 1,000 U ml! 
LIF, 1 pM PD0325901 (ERK inhibitor) and 3 11M CHIR99021 (inhibitor of GSK3B). 
After embryos attached, media was replaced every other day. After 10-11 days an 
outgrowth could be observed and was dissociated in 0.25% trypsin/EDTA using a 
mouth-controlled drawn glass pipette. Trypsin was inactivated by adding knockout 
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DMEM (Gibco) containing 15% ES-cell-compatible FBS, 2 mM L-glutamine, 0.1 mM 
B-mercaptoethanol, 0.1mM non-essential amino acids, 1mM sodium pyruvate, 
100Uml? penicillin and 100 pg ml! streptomycin, 1,000U ml! LIF, 1 uM 
PD0325901 and 3 1M CHIR99021. From this point onwards the culture was main- 
tained in the same media on mitotically inactivated MEFs. Media was replaced every 
other day. After 3-5 days ES-cell-like colonies were observed and passaged at 70% 
confluency. Stable ES-cell lines were validated by passaging a minimum of ten times 
without loss of pluripotent morphology or marker expression. 

Genomic analysis. RRBS was performed as described” and bisulphite reads were 
aligned to the bisulphite-converted mm9 using Bismark”’. Differentially methylated 
CpGs were identified using the Fisher exact test with Benjamini-Hochberg correc- 
tion for multiple testing. We defined DMRs as regions containing at least five 
differentially methylated CpGs, where contiguous differentially methylated CpGs 
are separated by 250 bp or less, and for which the total methylation change between 
wild-type and Aid-null cells is 10% or more (calculated using all CpGs within the 
considered region including those that were not called as differentially methylated). 
The distribution of DMR lengths was as follows. For hypermethylated DMRs, 
average, 222 bp; median, 190 bp; minimum, 10 bp; maximum, 4,081 bp. For the 
lower number of hypomethylated DMRs, we observed the following statistics: 
average, 229 bp; median, 188 bp; minimum, 19bp; maximum, 1,250 bp. Paired- 
end RNA-seq libraries were constructed as previously described** and sequenced 
using an Illumina HiSeq2000. Reads were aligned to mm9 using TopHat and gene 


expression levels were quantified using Cufflinks, using upper-quartile and GC- 
content normalizations. Twofold changes with FPKM > 5 in at least one condition 
were used to derive differentially expressed genes. All statistical analyses were 
performed using the R software package. The meta-data for all RBBS data are 
provided in Supplementary Table 3. 

Quantitative DNA methylation analysis by mass spectrometry. The level of 
DNA methylation for specific genes was measured using a MALDI-TOF mass 
spectrometry based method (EpiTYPER; Sequenom) as previously described”. 
Briefly, 1 ug of DNA was treated with sodium bisulphite using the EZ methylation 
kit (Zymo-Research). The treatment converts non-methylated cytosines into ura- 
cil, leaving methylated cytosines unchanged. PCR amplification, addition of SAP 
solution and Transcription/RNase A cocktails were performed according to the 
protocol provided by Sequenom and the mass spectra were quantified by the 
EpiTYPER analyser. Amplicons probed are given in Supplementary Table 2. 
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A stable transcription factor complex nucleated by 
oligomeric AMLI-ETO controls leukaemogenesis 


Xiao-Jian Sun’, Zhanxin Wang’, Lan Wang*"*, Yanwen Jiang”, Nils Kost®, T. David Soong’, Wei-Yi Chen, Zhanyun Tang’, 
Tomoyoshi Nakadai’, Olivier Elemento’, Wolfgang Fischle®, Ari Melnick°, Dinshaw J. Patel’, Stephen D. Nimer?"* 


& Robert G. Roeder! 


Transcription factors are frequently altered in leukaemia through 
chromosomal translocation, mutation or aberrant expression’. 
AML1-ETO, a fusion protein generated by the t(8;21) translocation 
in acute myeloid leukaemia, is a transcription factor implicated in 
both gene repression and activation’. AML1-ETO oligomerization, 
mediated by the NHR2 domain, is critical for leukaemogenesis**, 
making it important to identify co-regulatory factors that ‘read’ the 
NHR2 oligomerization and contribute to leukaemogenesis*. Here 
we show that, in human leukaemic cells; AML1-ETO resides in 
and functions through a stable AML1-ETO-containing transcrip- 
tion factor complex (AETFC) that contains several haematopoietic 
transcription (co)factors. These AETFC components stabilize the 
complex through multivalent interactions, provide multiple DNA- 
binding domains for diverse target genes, co-localize genome wide, 
cooperatively regulate gene expression, and contribute to leukae- 
mogenesis. Within the AETFC complex, AML1-ETO oligomeriza- 
tion is required for a specific interaction between the oligomerized 
NHR2 domain and a novel NHR2-binding (N2B) motif in E pro- 
teins. Crystallographic analysis of the NHR2-N2B complex reveals a 
unique interaction pattern in which an N2B peptide makes direct 
contact with side chains of two NHR2 domains as a dimer, providing 
a novel model of how dimeric/oligomeric transcription factors create 
a new protein-binding interface through dimerization/oligomeriza- 
tion. Intriguingly, disruption of this interaction by point mutations 
abrogates AMLI-ETO-induced haematopoietic stem/progenitor 
cell self-renewal and leukaemogenesis. These results reveal new 
mechanisms of action of AML1-ETO, and provide a potential thera- 
peutic target in t(8;21)-positive acute myeloid leukaemia. 
AMLI-ETO consists of the DNA-binding (RUNT) domain of the 
haematopoietic transcription factor AML1 (also known as RUNX1) and 
four conserved domains (NHR1-4) of ETO (also known as RUNX1T1)’. 
These domains differentially contribute to AML1-ETO activities in 
regulating cell proliferation, differentiation and survival’. In particular, 
the NHR2-mediated oligomerization of AML1-ETO has been shown 
to be critical for leukaemogenesis**. Although oligomerization endows 
AMLI1-ETO with a DNA-binding preference for duplicated AML1 
sites’, itis important to explore the possibility that oligomerization might 
also affect cofactor recruitment and function. AML1-ETO is generally 
thought to act as a transcriptional repressor by recruiting corepressors 
(for example, NCOR and HDAC proteins) to AML] target genes*”°, 
or by interacting and interfering with other transcription factors (for 
example, ETS family proteins, C/EBPa, GATA1 and E proteins)''”. In 
relation to its functions in gene activation, AML1-ETO also can recruit 
the coactivators p300 (ref. 18) and PRMT1 (ref. 19). Beyond these indi- 
cations of dynamic AML1-ETO interactions with diverse proteins, it has 
been unclear whether AMLI-ETO resides in any stable multiprotein 


complex(es) that might endow it with new properties that lead to altered 
regulatory events and corresponding cellular functions. 

To identify a natural AML1-ETO-containing complex in leukaemic 
cells, we used patient-derived Kasumi-1 cells and an antigen-purified 
anti-ETO antibody that showed high specificity and affinity (Sup- 
plementary Fig. 2a, b). The absence of wild-type ETO in Kasumi-1 
cells” allowed selective isolation of AML1-ETO from derived nuclear 
extracts, which contained most of the AML1-ETO (Supplementary 
Fig. 2c). Using a high-stringency buffer to preclude weak or non- 
specific interactions, we isolated the stable AETFC complex, whose 
components were identified by SDS-polyacrylamide gel electrophor- 
esis (SDS-PAGE) (Fig. la) and mass spectrometry (Supplementary 
Fig. 3a), and confirmed by immunoblot (Fig. 1b). These components 
include the AML1-binding partner CBF-B, E proteins HEB and E2A, 
the haematopoietic E-box-binding transcription factor LYL1 (but not 
its homologue SCL; also known as TAL1), the LIM-domain protein 
LMO2and its interacting partner LDB1. Although interactions between 
some of these factors (or their homologues) have been implicated in a 
related GATAI-SCL-E2A-LMO2-LDB1 complex in erythrocytes”, 
their connection with AML1-ETO in AMLis unknown. A gel-filtration 
analysis indicated that they form a stable, high-molecular-weight com- 
plex (Supplementary Fig. 3b). We then used baculovirus vectors to 
reconstitute AETFC and to characterize the pairwise interactions 
within AETFC (Supplementary Fig. 3c, d). The results revealed an 
interaction network (Fig. 1c) in which several strong interactions link 
all the components one by one and probably have a major role in 
AETFC assembly, although some weak interactions may further stabi- 
lize the complex. 

To assess potentially joint functions of the AETFC components, we 
performed a chromatin immunoprecipitation coupled with massively 
parallel DNA sequencing (ChIP-seq) analysis of AMLI-ETO, HEB, 
E2A and LMO2 in Kasumi-1 cells and validated some binding events 
by ChIP coupled with quantitative polymerase chain reaction (ChIP- 
qPCR)—for example, on the well-established CSF1R FIRE” and SPI 
URE” enhancers. Comparison of their binding sites and enrichment 
scores indicated a genome-wide co-localization and a correlation of 
binding strengths, and suggested that the interactions between the 
components facilitate their binding to the genome (Fig. 1d and 
Supplementary Fig. 4a—d). Sequence analysis of AML1-ETO-binding 
regions revealed that both AMLI sites and E-boxes (recognized by E 
proteins and other basic helix-loop-helix (bHLH) transcription factors 
including LYL1) were overrepresented (Supplementary Fig. 4e, f), 
suggesting that, in addition to direct binding to DNA through the 
RUNT domain, AMLI-ETO also binds to DNA indirectly through 
its interaction with the E-box-binding AETFC components. 

To investigate the role of AETFC in regulating gene expression and 
leukaemogenesis, we performed gene knockdown in Kasumi-1 cells” 
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Figure 1 | AML1-ETO resides in and functions through AETFC. a, SDS- 
PAGE and silver staining of AETFC isolated from Kasumi-1 nuclear extract. 
Asterisks, non-specific bands. rlgG, rabbit immunoglobulin G. b, Co- 
immunoprecipitation and immunoblot confirmation of AETFC components. 
Asterisk, IgG signal. c, Schematic of interactions within AETFC. Thick and thin 
lines denote strong and weak interactions, respectively. Double spheres denote 


and in a mouse leukaemic model induced by AML1-ETO9a (AE9a), 
a leukaemogenic, truncated form of AML1-ETO™”. First of all, indi- 
vidual knockdowns of AETFC components (but not SCL) significantly 
decreased some other components at the protein level but not at the 
messenger RNA level (Supplementary Fig. 5a, b), suggesting a mutual 
stabilization mechanism within AETFC. Interestingly, SCL mRNA was 
downregulated by knockdown of any AETFC component (Supplemen- 
tary Fig. 5b). Because the 3’ haematopoietic enhancer of SCL” is bound 
by AETFC (Supplementary Fig. 5c), SCL is probably a direct target gene 
of AETFC. In an extension of this observation, global ChIP-seq and 
RNA-seq analyses revealed that the genes up- and downregulated by 
one AETFC component were similarly regulated by others (Fig. le and 
Supplementary Fig. 5d). These analyses led to the identification of a 
set of genes that are both directly bound and cooperatively regulated by 
AETFC components (Supplementary Fig. 5e and Supplementary 
Tables 1, 2). We next showed that knockdowns of AETFC components 
considerably delayed leukaemogenesis in mice (Fig. 1f and Supplemen- 
tary Fig. 6), indicating a requirement for AETFC components in 
AMLI1-ETO-mediated leukaemogenesis. Notably, double knockdown 
of HEB and E2A most markedly delayed leukaemogenesis, which is 
consistent with the primary importance of these two E proteins in 
AETFC assembly/stabilization (Fig. 1c and Supplementary Fig. 5a, f). 
To investigate further the function and mechanism of these two E 
proteins in AETFC, we performed co-immunoprecipitation experi- 
ments with a series of deletion mutants of AML1I-ETO and HEB, as 
well as a GST pull-down assay with isolated domains. These analyses 
clearly established multivalent interactions between AML1-ETO and 
E proteins and, interestingly, showed that the functionally critical 
NHR2 domain mediates not only oligomerization but also an inter- 
action with E proteins (Supplementary Fig. 7; ref. 16). We next took 
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potential homodimerization of components. d, ChIP-seq and ChIP-qPCR 
analyses of AETFC components on CSF1R and SPI. e, Correlation between AE 
and LDB1 in regulating gene expression. shCtrl, control shRNA. f, Knockdown 
of AETFC components delays AE9a-induced leukaemogenesis in mice, 
indicated by representative bioluminescent imaging (top) and survival curves 
(bottom; n = 10; ***P < 0.001; **P < 0.01). 


several steps to clarify the relationship between NHR2 oligomerization 
and the NHR2-E-protein interaction. First, we found that the multisite 
NHR2 mutation m7, which disrupts NHR2 oligomerization’, also 
completely disrupts the NHR2-E-protein interaction (Fig. 2a, b). 
Second, we performed an exhaustive screening of NHR2 mutations 
and found that NHR2 oligomerization and the NHR2-E-protein 
interaction involve different surfaces of the NHR2 a-helix (Fig. 2a—d). 
Accordingly, we identified point mutations that specifically disrupt 
the NHR2-E-protein interaction (Fig. 2b) but not NHR2 oligomeriza- 
tion, as indicated by co-immunoprecipitation (Supplementary Fig. 8) 
and gel-filtration (Fig. 2c) assays. Third, we found that the mutant- 
wild-type AML1-ETO heterodimer, unlike the wild-type homodimer, 
failed to bind HEB(AAD1) (Fig. 2e), indicating that a single wild-type 
AML1-ETO in the dimer is unable to bind HEB(AAD1) and, thus, that 
NHR2 oligomerization is required for the NHR2-E-protein interaction. 

To clarify further the mechanism of E-protein recognition of the 
NHR2 oligomer, we first mapped the N2B region in HEB. As a result, 
we identified a novel, 18-amino-acid N2B motif that is required for the 
NHR2 interaction (Fig. 3a, b and Supplementary Fig. 9). A subsequent 
X-ray crystallographic analysis of the NHR2-N2B complex showed 
that NHR2 formed a tetrameric o-helical bundle* with four N2B pep- 
tides bound on two symmetrically related surfaces with 1:1 stoichi- 
ometry (Fig. 3c and Supplementary Fig. 10a). After an independent 
validation of the directionality and register of the bound N2B peptide 
(Supplementary Fig. 10b, c), we demonstrated that a given N2B binds 
to a narrow surface channel created by two NHR2 helices, with P191 
and $192 of N2B inserted deeply into a surface pocket (Fig. 3d). 
Notably, P191 of N2B interacts through hydrogen bonding with 
$522 from one NHR2 helix, while $192 and V194 form hydrogen 
bonds with E501 and H504, respectively, from another NHR2 helix 
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mutant was eluted as a monomer. d, Distinct surfaces of the NHR2 o-helix 
mediate oligomerization and the E-protein interaction. e, NHR2 
oligomerization is required for the NHR2-E-protein interaction. Mutant- 
mutant-AE homodimer (lanes 1, 4), mutant-wild-type heterodimer (lanes 2, 5) 
and wild-type—wild-type homodimer (lanes 3, 6) were formed in 293T cells by 
co-expression of wild-type AE and NHR2-mutated AE (XX) that fails to 
interact with HEB(AAD1), and their abilities to bind to HEB(AAD1) were 
determined by co-immunoprecipitation. IP, immunoprecipitant. 


Figure 3 | Structural details of the 
NHR2-N2B interaction. a, Alignment of 
the N2B regions of vertebrate E proteins. 
Arrows denote residues mutated to alanine in 
mutants aml-am8. Gg, chicken; Xl, frog. 

b, Identification of the N2B motif by 
co-immunoprecipitation. c, Side (top) and 
look-down (bottom) ribbon views of the 
structure of the complex between the 
N2B(177-200) peptide and the 
tetrameric-helical bundle of NHR2. The N2B 
peptide is shown in stick representation, and 
the NHR2 helices in ribbon representation. 
d, Positioning of the N2B peptide on the 
electrostatic surface of the NHR2 tetramer. 

e, One N2B peptide interacts with two 
NHR2 helices through hydrogen bonds. 

f, g, Fluorescence-polarization analysis of the 
binding between NHR2 and a shorter 
N2B(182-196) peptide, comparing wild-type 
(WT) and indicated mutants of NHR2 (f) and 
N2B (g). n = 3; mean + standard deviation 
(s.d.). 
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(Fig. 3e). This unique interaction pattern confirms the results of the 
cell-based assay (Fig. 2e) by showing, unequivocally, that two dimer- 
ized NHR2 helices are required for the N2B interaction. To verify the 
binding specificity further, we made mutations in NHR2 and in a 
shorter N2B(182-196) peptide, and determined their interactions 
using fluorescence polarization assays. The results showed that the 
binding of such a short N2B peptide to NHR2 is relatively weak but 
highly specific, as the mutation in NHR2 (mNHR2; Supplementary 
Fig. 11a) completely abolished the binding (Fig. 3f). Reciprocally, single 
mutations of the key residues P187, P191 or $192 of N2B also signifi- 
cantly reduced the binding, whereas mutation of the less important 
P188 residue had very little effect (Fig. 3g). 

To study the biological relevance of the AMLI-ETO-E-protein 
interactions in leukaemogenesis, we introduced the mNHR2 mutation 
that specifically disrupts the NHR2-N2B interaction without affecting 
NHR2 oligomerization (Figs 2c, 3f and Supplementary Fig. 11a, b), as 
well as the mNHR1 mutation that blocks the NHR1-AD1 interaction”. 
Effects of the NHR1 and NHR2 mutations on AML1-ETO function 
were initially analysed with the human CD34" haematopoietic stem/ 
progenitor cell (HSPC) self-renewal assay**. Although AML1-ETO and 
the mNHR1, mNHR2 and mNHR1+2 mutants exhibited comparable 
expression levels in transduced CD34* cells (Supplementary Fig. 11c), 
mNHR2 significantly impaired the ability of AMLI-ETO to maintain 
CD34" cells (Fig. 4a). We also measured the frequencies of functional 
HSPCs using cobblestone-area-forming cell (CAFC) and long-term 
culture-initiating cell (LT'C-IC) assays. Notably, mNHR2 significantly 
reduced the CAFC and LTC-IC colonies in both assays, whereas 
mNHRI showed relatively mild effects and no clear synergy between 
mNHRI and mNHR2 was evident (Fig. 4b, c). These results suggest that 
the NHR2-N2B interaction, rather than simply acting additively with 
the NHR1-AD1 interaction to increase binding affinity, may act as a 
conformational switch within AETFC (Supplementary Fig. 1a) and 
thus contribute to AML1-ETO-enhanced HSPC self-renewal. By con- 
trast, these mutations showed no effect on AMLI1-ETO activities in 
differentiation inhibition (Supplementary Fig. 12) and cell growth 
arrest (data not shown). To exclude the possibility that mNHR2 may 
affect interactions other than the E-protein interaction, we designed 
(based on the structure) another NHR2 mutation (mNHR2S) that 
effected a similar loss of E-protein binding but differed completely in 
sequence from mNHR2. Nevertheless, mNHR2 and mNHR2S showed 
similar effects in HSPC self-renewal and differentiation (Supplemen- 
tary Fig. 13), indicating that these effects were specifically caused by 
disruption of the NHR2-N2B interaction. 

To determine whether these interactions are also required for leu- 
kaemogenesis in vivo, we used the mouse leukaemic model” involving 
transplantation of mouse HSPCs transduced with wild-type and 
mutated AE9a. In this assay, mice carrying AE9a-mNHR2 showed a 
significant delay in leukaemogenesis compared with AE9a mice. First, 
AE9a-mNHR2 mice showed a significantly lower white-blood-cell 
count than AE9a mice (Fig. 4d), consistent with peripheral blood 
morphology (Supplementary Fig. 14a). Second, mNHR2 mice had a 
lower percentage of c-Kit"/Mac-1/GFP* leukaemic blast cells in 
their peripheral blood (Supplementary Fig. 14b, c). Third, undiffer- 
entiated leukaemic cells were enriched in AE9a mice but not in 
mNHR2 mice (Fig. 4e). Last, mNHR2 mice survived significantly 
longer than AE9a mice (Fig. 4f). The mNHR1 mutation again showed 
relatively mild effects in suppressing leukaemogenesis in mice. Taken 
together, both human HSPC and mouse transplantation assays clearly 
showed that the NHR2-N2B interaction is required for AML1-ETO- 
mediated leukaemogenesis. Given that the extremely high stability of 
the NHR2 oligomer makes its therapeutic targeting very challenging’, 
the NHR2-N2B interaction potentially offers a new target. 

In contrast to previously reported dynamic interactions of various 
proteins with AML1-ETO under overexpression conditions, this study 
provides a direct and unbiased analysis of natural AML1-ETO in leuk- 
aemic cells, and identifies an endogenous stable complex (AETFC). The 
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Figure 4 | Role of the AML1-ETO-E-protein interactions in human HSPC 
self-renewal and mouse leukaemogenesis. a, Effects of AE and derived 
mutants in the maintenance of human CD34" cells. b, CAFC assays measuring 
the frequency of functional HSPCs in cells transduced with AE or derived 
mutants. c, LTC-IC assays measuring the frequency of functional HSPCs after 
long-term in vitro culture of cells transduced with AE or derived mutants. 
a-—c, nm = 3; mean = s.d. d, Effects of AE9a and derived mutants in mouse 
leukaemogenesis, as indicated by white blood cell (WBC) count of peripheral 
blood at 4 months after transplantation. e, Morphological analysis of mouse 
bone marrow cells. Original magnification, < 1,000. f, Kaplan-Meier survival 
curves of the mice (n = 15). 


multiple components in AETFC offer opportunities for the recruitment 
of AMLI-ETO to a variety of target genes and for the regulation of 
gene expression through context-dependent functional interactions with 
diverse transcriptional coactivators and corepressors (Supplementary 
Figs 1, 15; further discussion in Supplementary Information). In leuk- 
aemic cells, most, if not all, AML1-ETO probably resides in and functions 
through AETFC, because (1) knockdown of other AETFC components 
destabilizes AMLI-ETO, (2) many AML1-ETO-regulated genes are 
likewise co-occupied and co-regulated by AETFC components, and (3) 
knockdowns of AETFC components significantly impair leukaemogen- 
esis. Consistent with the observation that HEB and E2A have essential 
roles in AETFC assembly and stabilization, and in leukaemogenesis, the 
direct AML1-ETO-E-protein interaction is shown to be required for 
leukaemogenesis. Among the two defined AML1-ETO-E-protein inter- 
action sites, surprisingly, the NHR2-N2B interaction, which requires 
oligomerization of NHR2, is necessary and sufficient for AML1-ETO- 
mediated leukaemogenesis (further discussed in Supplementary Infor- 
mation). Oligomerization of transcription factors has been known to be 
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an important regulatory mechanism for modulating their DNA-binding 
specificity and affinity; and a choice of different oligomerizing partners 
also contributes to the selective recruitment of cofactors”**°. The unique 
NHR2-N2B interaction provides a novel model in which oligomeric 
transcription factors use their oligomerization domains to create a 
new protein-binding interface. Thus, this study reveals new mechanisms 
of action of oligomerized AML1-ETO in leukaemogenesis, and provides 
potential therapeutic possibilities. 


METHODS SUMMARY 


Kasumi-1 cells were grown in 12-litre spinner flasks (BellCo) and the nuclear 
extract was prepared with our standard high-salt extraction method. Complex 
purification was performed in a high stringency buffer (containing 300 mM KCl 
and 0.1% NP-40). The Bac-to-Bac Baculovirus Expression Systems (Invitrogen) 
was used to reconstitute the complex in Sf9 cells. ChIP was performed with a dual 
cross-linking method, and ChIP- and RNA-seq were performed following 
Illumina protocols. The short hairpin RNA (shRNA) targeting the AMLI-ETO 
fusion site was designed as previously described”; and other lentiviral shRNA sets 
were purchased from Open Biosystems. The co-immunoprecipitation assay was 
performed in a buffer containing 300 mM NaCl and 1% Triton X-100. Purified 
N2B peptide and NHR2 domain were mixed at a 1:1 ratio and crystals of the 
complex were grown by the hanging-drop method. Human HSPC self-renewal 
and differentiation assays'***, and mouse leukaemogenesis assays”, were per- 
formed as previously described. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Protein purification and mass spectrometry. Kasumi-1 cells were grown in 12- 
litre spinner flasks (BellCo) and the nuclear extract was prepared by our standard 
high-salt extraction method’!. A custom anti-ETO antibody (Covance) was deve- 
loped against a C-terminal peptide of ETO and purified with a recombinant 
protein. Complex purification was performed in buffer BC300 (20 mM Tris, pH 
7.3, 300mM KCI, 0.2mM EDTA and 20% glycerol) plus 0.1% NP-40. Eluted 
proteins were separated by SDS-PAGE and visible bands on the gel were sliced 
separately and subjected to liquid chromatography coupled with tandem mass 
spectrometry (LC-MS/MS) analysis. For immunoblots, the following antibodies 
were used: ETO (Santa Cruz; sc-9737), HEB (Santa Cruz; sc-357), E2A (Santa 
Cruz; sc-763), LDB1 (Santa Cruz; sc-11198), LYL1 (Abcam; ab30334), CBF-B 
(Pierce; PA1-317), LMO2 (Abcam; ab81988) and SCL (a gift from R. Baer). 

Gel filtration. The antigen-eluted complex was subjected to gel filtration with a 
Precision Column PC 3.2/30 that was pre-packed with Superose 6 and connected to 
an AKTA FPLC system (GE Healthcare). A Superdex 75 column (GE Healthcare) 
was used to determine the oligomeric states of wild-type and mutant AML1-ETO 
NHR2. 

Reconstitution of protein complex. The Bac-to-Bac Baculovirus Expression 
Systems (Invitrogen) was used to generate recombinant baculoviruses containing 
each AETFC component. Reconstituted protein complexes were purified from Sf9 
insect cells co-infected with different baculoviruses. 

ChIP and ChIP-seq. Antigen-purified anti-ETO antibody and antibodies against 
HEB (Santa Cruz; sc-357), E2A (Santa Cruz; sc-763) and LMO2 (Abcam; ab81988) 
were used for ChIP and ChIP-seq assays. ChIP was performed with a dual cross- 
linking method. ChIP-seq libraries were constructed following the Illumina pro- 
tocol and sequenced on the Illumina Genome Analyzer II. Peak-calling/annotation 
and consensus sequence analyses were performed with ChIPseeger™, CisFinder* 
and Cisgenome™. 

shRNA knockdown. The shRNA against AML1-ETO was designed to target the 
fusion site**. Other lentiviral shRNA sets were purchased from Open Biosystems. 
Virus preparation and cell infection were performed according to the manufac- 
turer’s protocol. Knockdown efficiencies were analysed by RT-qPCR and/or 
immunoblot at 4 days after infection. 

RNA-seq. Kasumi-1 cells with knockdowns of AETFC components were har- 
vested at 4 days after infection and RNA was extracted with TRIzol Reagent 
(Invitrogen). Libraries were generated using the TruSeq RNA Sample 
Preparation Kit (Illumina) and sequenced with Illumina HiSeq2000. The reads 
were aligned with TopHat and then processed with an in-house program to 
calculate the reads per kilobase per million reads (RPKM) value for each gene 
following published methods”. 

Cell transfection and co-immunoprecipitation. 293T cells were transfected 
using Lipofactamine 2000 (Invitrogen). Cell lysis and co-immunoprecipitation 
were performed with T/G lysis buffer (20 mM Tris-HCl, pH 7.5, 300 mM NaCl, 
50 mM NaF, 2mM EDTA, 1% Triton X-100 and 20% glycerol), and proteins were 
analysed by immunoblot. 

GST pull-down. GST-tagged proteins were expressed in Escherichia coli and 
purified with Glutathione Sepharose 4B (GE Healthcare). The *°S-labelled prey 
proteins were synthesized with the TNT Quick Coupled Transcription/Trans- 
lation System (Promega). Pull-down assays were performed with NETN buffer 
(20 mM Tris-HCl, pH 8.0, 100 mM NaCl, 1 mM EDTA and 0.5% NP-40), followed 
by SDS-PAGE and autoradiography analyses. 

Co-expression of N2B fragment and NHR2 domain. The His-Sumo-tagged 
N2B(100-219) fragment of HEB and the GST-tagged NHR2(482-551) domain 
of AML1-ETO were co-expressed in E. coli and purified by GST or His pull-down. 
By serial truncation of N2B from both ends, a short N2B(177-200) peptide was 


shown to retain its binding affinity for NHR2, and this N2B fragment was used for 
crystallization trials. 

Crystallization and structure determination. Purified N2B(177-200) peptide 
and NHR2(486-548) domain were mixed at around 1:1 ratio and subjected to 
crystallization trials. Crystals of the complex were grown by the hanging-drop 
method at 20°C in buffer containing 0.1 mM Tris (pH 8.5) and 20% ethanol. A 
2.9 A resolution data set of the complex was collected on beamline 24ID-E at the 
Advanced Photon Source, Argonne National Laboratory. The initial model was 
solved by molecular replacement using the free NHR2 structure as the search 
model. The validation of the directionality and register of the bound N2B peptide 
were performed with the selenomethionine-labelling method following the intro- 
duction of V186M, V194M and corresponding double mutations. 
Fluorescence-polarization-based measurements. Peptides were labelled with 
fluorescein at the N terminus. Fluorescence-polarization assays were carried out 
and analysed using FP buffer (10mM TEA, 20mM NaCl, pH 7.4)”*. Titration 
series of protein in 10 pl volume and containing 100 nM fluorescinated peptide in 
384-well plates were read multiple times on a Plate Chameleon II plate reader 
(HIDEX Oy). 

Human HSPC self-renewal and differentiation. CD34" cells were purified from 
human cord blood samples using the MACS CD34 Isolation Kit (Miltenyi Biotec). 
After expansion, cells were infected with MIGRI1-derived retroviruses. GEP*/ 
CD34* cells were sorted and grown in basic or differentiation medium and re- 
plated weekly. Numbers of HSPC and differentiated cells were determined by cell 
counting and flow cytometry. For the CAFC assays, CD34” cells were co-cultured 
onan MS-5 monolayer and demi-depopulated weekly. The cobblestone areas were 
scored at week 5. For LTC-IC assays, the cells were plated at week 6 in methylcel- 
lulose with cytokines, and the colony numbers were scored 14 days after seeding'***. 
Mouse fetal liver transplantation and leukaemogenesis assays. Mouse fetal liver 
HSPCs were infected with MIGRI1-derived retroviruses and cultured in X-VIVO 
medium (Lonza) with cytokines. Recipient mice were lethally irradiated and trans- 
duced fetal liver cells were transplanted into recipient mice by tail-vein injection. 
Leukaemia development was determined by complete blood count, flow cytome- 
try and morphological analyses of peripheral blood and bone marrow cells”. The 
overall survival of the mice was analysed by the Kaplan-Meier method, and the 
statistical significance was evaluated by the logrank test. For bioluminescent 
imaging of leukaemia in mice*’, AE9a leukaemic cells were transduced with 
MSCV-Luciferase PGK-hygo (Addgene). On injection of the substrate of lucifer- 
ase, D-Luciferin, bioluminescent imaging was performed using an IVIS100 
imaging system. For gene knockdown assays in the mouse model, a mouse leuk- 
aemic cell line was generated from the spleen of the mice bearing AE9a-induced 
leukaemia. These cells were infected with shRNA lentiviruses, selected by puro- 
mycin and injected into recipient mice to generate leukaemia. 
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Protein folding is often described as a search process, in which poly- 
peptides explore different conformations to find their native struc- 
ture. Molecular chaperones are known to improve folding yields by 
suppressing aggregation between polypeptides before this conforma- 
tional search starts'”, as well as by rescuing misfolds after it ends’”. 
Although chaperones have long been speculated to also affect the 
conformational search itself—by reshaping the underlying folding 
landscape along the folding trajectory*°—direct experimental evid- 
ence has been scarce so far. In Escherichia coli, the general chaperone 
trigger factor®* (TF) could play such a role. TF has been shown to 
interact with nascent chains at the ribosome””’, with polypeptides 
released from the ribosome into the cytosol", and with fully folded 
proteins before their assembly into larger complexes’”. To investigate 
the effect of TF from E. coli on the conformational search of poly- 
peptides to their native state, we investigated individual maltose 
binding protein (MBP) molecules using optical tweezers. Here we 
show that TF binds folded structures smaller than one domain, which 
are then stable for seconds and ultimately convert to the native state. 
Moreover, TF stimulates native folding in constructs of repeated 
MBP domains. The results indicate that TF promotes correct folding 
by protecting partially folded states from distant interactions that 
produce stable misfolded states. As TF interacts with most newly 
synthesized proteins in E. coli, we expect these findings to be of 
general importance in understanding protein folding pathways. 


Atomic force microscopes and optical tweezers can be used to study 
protein folding at the single-molecule level’*"*. By pulling proteins at 
their ends while accurately monitoring lengths and forces, one can 
directly observe partially folded states that are only transiently stable, 
and follow transitions between them. This method has been used 
primarily to investigate isolated proteins’*’* or proteins that interact 
with metal ions’” or ligands'*”, but rarely to address how folding 
is affected by chaperones”. Here we investigate how the chaperone 
trigger factor (TF) influences the folding of two proteins: the single- 
domain MBP, which folds efficiently in isolation, and an MBP repeat 
construct, which has a high tendency to misfold. This approach made 
it possible to distinguish how TF affects the folding transitions within 
one domain and misfolding between domains. 

In the absence of any chaperone, stretching MBP molecules with 
optical tweezers (Fig. 1a and Supplementary Fig. 1) results in a simple 
unfolding pattern in the force-extension domain that involves two 
transitions” (Fig. 1b). When the force reached 10 pN on average, the 
measured total protein length had increased by 28 nm, indicating the 
unfolding of a polypeptide segment of that length. The remaining 
structure unfolded completely at around 22 pN, yielding the full 
MBP length of 120 nm. After relaxation to zero force, the polypeptide 
chain was allowed to refold for 5s. Subsequent stretching typically 
showed either a fully folded or fully unfolded state, as evidenced by 
a curve similar to the first or by a curve lacking unfolding features 
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Figure 1 | TF promotes partial protein folds. a, Schematic diagram of the set- 
up. MBP (green) is tethered between two beads: one (right) is held on a 
position-controlled micropipette; the other (left) is held by an optical trap that 
allows force detection. At the amino terminus, MBP is attached to a bead by a 
920-nm-long DNA tether (orange) via streptavidin—biotin linkages. The DNA 
is attached to the bead using antibody-digoxigenin linkages. At the carboxy 
terminus, MBP is attached to a bead via an antibody—myc-tag connection. 

b, Stretching-relaxation experiments in the absence of TF. The force-extension 
data shows a frequently visited intermediate state Ip in between the fully folded 
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state N and the unfolded state U. c, Stretching-relaxation experiments in the 
presence of TF (1 uM), indicating frequently visited additional stable states of 
intermediate protein length. d, Protein length distribution of stable states 
without (light blue) and with (dark blue) TF, as derived from the force- 
extension data. Lengths are determined by fitting to the worm-like chain 
model”. Error bars, + 1s.d. The native, Ip and fully extended states correspond 
to lengths of about 0 nm, 28 nm and 120 nm. The number of times a state with a 
certain length is visited is indicated as a fraction of the total number of visits to 
all states. See main text for meaning of asterisks. 
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(refolding rate ~0.7s '), in line with a two-state folding process”. 
Analysis of many stretching curves showed that a small fraction dis- 
played additional protein states that were visited briefly (for less than 
1s) during unfolding (9% of observed states, N = 215). In the distri- 
bution of protein lengths, these states are seen as four small but sig- 
nificant peaks between 40 and 110nm (indicated by asterisks in 
Fig. 1d), which are consistent with partially folded states observed 
previously during atomic force microscope unfolding of MBP and 
truncated MBP constructs*'. Thus, a number of partially folded MBP 
states can be transiently stable during unfolding in the absence of 
chaperone. 

In the presence of TF (1 uM, Methods) the first stretching curves 
were unchanged, but subsequent stretching and relaxation data were 
quite different (Fig. 1c). The intermediate protein lengths that were 
rare without chaperone were now observed frequently and for longer 
times (of the order of seconds). Consequently, the four additional 
peaks in the distribution of observed lengths were substantially higher 
(Fig. 1d, asterisks). The peaks were distinct, suggesting that in different 
experiments the protein chains repeatedly adopted similar structures. 
The broadness of some peaks, and the small unfolding transitions 
observed occasionally (Supplementary Fig. 2), did indicate small struc- 
tural variations. Overall, the increased peak heights indicated that TF 
promoted the presence of partially folded states. The correspondence 
in peak positions (Fig. 1d, asterisks) suggested that they contained 
elements of the MBP native state. 

To probe interactions within the TF-promoted partial folds, we 
considered how they were disrupted by force. Disruption occurred 
in discrete steps at high force, consistent with the unfolding of partial 
folds that exhibit tertiary structure. In contrast, if the polypeptide were 
sequestered on TF without tertiary structure, disruption would be 
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expected to occur gradually. We found that with TF the unfolding 
force of partial folds was almost twice that without (15 versus 27 pN 
on average, Fig. 2a). Some unfolding forces exceeded even 40 pN, 
which is significantly higher than the force at which the native state 
unfolds without chaperone (22 + 5 pN). These data indicate that TF 
either bound the partial folds directly, or that it simultaneously bound 
the two non-folded polypeptide segments directly flanking the folded 
structure. Stabilization by TF thus seems to have similarities with the 
stabilization of fully folded proteins by ligand binding’’. Unfolding of 
the partial folds could be triggered by (partial) TF dissociation. This 
scenario would be consistent with reported TF-substrate lifetimes” *, 
which are of similar order (between 0.1 and 100s) to that of the 
stretching experiments (several seconds). 

The protection against forced unfolding suggests that TF may also 
stabilize partial folds in the absence of force, and thus affect folding to 
the native state. To address this issue, we analysed folding transitions 
during the 5-s waiting periods at zero force in the presence of TF, by 
comparing protein lengths before and after this period (Fig. 2b-e). 
Sometimes, the protein length increased (14%, N = 33; Supplemen- 
tary Fig. 2) or remained constant (22%; Fig. 2d, experiment 2), indi- 
cating that the folded structure had unfolded or remained stable, 
respectively. Most often however (64%) the protein length decreased, 
indicating that a larger part of the protein chain had become folded. 
We observed the full range of these refolding transitions: from 
unfolded to an intermediate length (Fig. 2e, experiment 2), from an 
intermediate length to a shorter one (Fig. 2c, experiment 2), and froma 
shorter one onwards to the native length (Fig. 2c experiment 4 and 
Fig. 2b experiment 2). Given the 5-s waiting period, the average lifetime 
of TF-promoted partial folds is thus of the order of seconds”. These 
visits to partial folds thus significantly decrease the overall folding rate, 
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Figure 2 | TF stabilizes partial folds and mediates folding transitions. 

a, Distribution of forces maximally sustained by partial MBP folds (protein 
lengths between 40 and 110 nm). In the presence (dark blue) of TF the forces are 
higher than in its absence (light blue), indicating protection against forced 
unfolding. b-e, Transitions between folded states with TF. In the diagrams 
(top), circles represent protein states, with white the fully extended state and 
black the most compact state. Arrows represent transitions between these states 
during stretching (high force) or relaxation followed by waiting at zero force for 
5s (zero force). Numbers indicate the order of experiments. Graphs (bottom) 
show force-extension curves, with the colours corresponding to the arrows 
shown in diagrams above. At high force, proteins typically unfold but can also 
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remain intact (c, experiment 3). At zero force, the data show the full spectrum of 
folding transitions: from unfolded to intermediate (e, experiment 2), from 
intermediate to another intermediate (c, experiment 2), and from intermediate 
to native (c, experiment 4, and b, experiment 2), as evidenced by subsequent 
characteristic unfolding pattern (b, experiment 3). The protein length can also 
remain unchanged (d, experiment 2). For clarity, the stretching-relaxation 
behaviour of fully unfolded protein with DNA tether is indicated by a black line. 
f, Average probabilities of decreasing, maintaining or increasing the protein 
length in the stretching (high force) and relaxation (zero force) experiments 
with TF, starting from any intermediate MBP fold (b-e). 
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consistent with bulk studies'’’*”®. The partially folded states were not 
observed when pulling from zero force in the absence of TF, suggesting 
they were either visited too briefly during refolding at zero force, or did 
not form at all. In the absence of applied force, TF thus transiently 
stabilized partial folds, resulting in a step-wise refolding via these par- 
tial folds. To test the generality of these central findings, we performed 
a similar set of experiments on a different substrate, namely the protein 
luciferase, which gave comparable results (Supplementary Fig. 6). 
Two main lines of evidence indicate that TF-mediated folding leads 
to the native structure. First, we found MBP function is restored after 
chemical denaturing and refolding in the presence of TF in a bulk 
assay, as tested by amylose binding (Fig. 3a). Second, after unfolding 
by force and relaxation, subsequent stretching experiments displaying 
the characteristic two-step native unfolding pattern indicated that 
MBP had natively refolded (Fig. 2b experiment 3). Once MBP is folded 
natively, we find that it binds TF only weakly or not at all. This can be 
seen in the optical tweezers experiments, where the first stretching 
curve showed the same unfolding pattern and unfolding force—with 
or without TF. In bulk, a column retardation assay further confirmed 
this lack of interaction (Fig. 3c). Partial folds do interact with TF 
(Fig. 2a). To confirm this in a bulk assay, we purified MBP constructs 
that are truncated and thus cannot form the full MBP native state but 
can form partially folded structures (Fig. 3b). TF indeed bound these 
truncates in column retardation assays, in contrast to the full-length 
MBP (Fig. 3c). TF-truncate binding was also evidenced by a suppres- 
sion of heat-induced aggregation (Fig. 3d). Overall, the data confirm 
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Figure 3 | TF binds partially folded states, not the native state. a, Bulk 
refolding and functional assay of MBP. Indicated is the amount of protein 
specifically bound to amylose, a mimic of the natural MBP substrate maltose, 
after chemical denaturation and subsequent refolding without and with TF (see 
Methods). a.u., arbitrary units. b, Constructed MBP truncates 1-3 in purple and 
red, with red indicating their C and N termini (see Methods, Supplementary 
Figs 3 and 4). c, TF binding to MBP truncates. All three MBP truncates bind TF, 
natively folded MBP (wild type; WT) does not. See Methods. d, Thermal 
aggregation assay. Displayed is an Fourier transform infrared (FTIR) signal that 
indicates the amount of precipitated MBP truncate 3 (b) as a function of 
temperature, in the absence (red) and presence (orange) of TF (see Methods). 
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that TF binds partial folds along the folding pathway until it reaches 
the native state. 

We surmised that TF association could be particularly beneficial for 
complex multi-domain proteins, which can suffer from misfolding 
interactions between domains”. To test this hypothesis, we performed 
stretching experiments on a construct composed of four MBP repeats 
(MBP,). When unfolded MBP, was relaxed in the absence of chaper- 
one, subsequent stretching typically showed it had become as compact 
as native MBP, but it failed to unfold”® (Fig. 4a). This suggested that 
tight misfolding interactions had formed between domains. When 
unfolding did occur, the length increases were typically more than 
one full-length domain (370 residues). This large step-size indicated 
the disruption of interactions between domains, though less tight than 
observed without chaperone. Occasionally we observed signatures of 
native-like folds involving just one domain, as characterized by unfold- 
ing forces below 30 pN and length changes smaller than one MBP 
repeat. Overall, we found that the major part of the protein chain 
(73% on average, N= 41 stretching curves) was involved in tight 
(35%) or weak (38%) misfolding interactions, and the rest (27%) in 
native interactions (Fig. 4b). These data indicate a strong tendency for 
MBP, to misfold. 

In the presence of TF, the first stretching curve on fresh MBP, was 
similar to curves without TF. However, subsequent stretching of the 
same construct after relaxation showed marked differences (Fig. 4a). 
Strikingly, with TF present, although weak misfolding interactions 
were still substantial (35%), the tight misfolds that failed to unfold 
were only rarely observed (less than 4%). The majority of the protein 
chain was now involved in native unfolding signatures (61%, N = 55, 
Fig. 4b). As a result, over 10 unfolding-refolding cycles on one con- 
struct could sometimes be followed, contrasting with a maximum of 3 
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Figure 4 | TF suppresses misfolding between domains. a, Force-extension 
curves for a four MBP repeat construct, with and without TF (1 1M). See Fig. 2 
legend for description of diagram under. Without TF: after unfolding 
(experiment 1, orange), relaxation (2, blue), waiting at zero force for 5s, 
stretching (3, bright red) shows no unfolding, indicating tight misfolding 
interactions between domains. With TF: after unfolding (not shown) and 
relaxation (2, green), waiting at zero force, stretching (3, dark red) showed 
extension changes characteristic of native interactions (for example, at 

~1.23 tum) and weak non-native interactions (at ~1.10 um), but no tight ones. 
b, Misfolded and folded fractions with and without TF. Indicated are the 
fractions of the polypeptide length involved in native, weak and tight 
misfolding interactions (see main text). Data are derived from multiple force- 
distance curves (N = 41 without, N = 55 with TF). c, Bulk refolding and 
functional assay of the multi-domain protein. Indicated is the amount of 
protein specifically bound to amylose, after chemical denaturation and 
subsequent refolding without and with TF, and with the protein BSA (Methods, 
Supplementary Fig. 4). d, Cartoon of folding landscape reshaping by TF. Left, 
without chaperone, interactions between domains leads to efficient misfolding. 
Right, the stabilization of partial folds by TF can be seen as a deepening or 
formation of energy valleys. The barrier heights to misfolds are increased more 
than to native folds, in line with the promotion of local over distant interactions 
along the polypeptide. 
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cycles due to tight misfolding in the absence of TF. We further probed 
native refolding of MBP, in a bulk assay, which showed that in the 
absence of chaperone MBP, failed to refold to a functional state, 
whereas the presence of TF restored native folding, as shown by a more 
than tenfold increase in amylose binding ability (Fig. 4c, Methods). A 
control experiment in which bovine serum albumin (BSA) was used 
instead of TF again led to misfolding and an inability to bind amylose, 
indicating that the effect of TF is not caused by aspecific interactions. 
We note that although vectorial synthesis and co-translational folding 
may in principle also suppress misfolding between domains”, inter- 
actions with TF delay folding until nascent chain lengths are well 
beyond the average size of one domain”’. Thus, the propensity of TF 
to promote native folding within domains while suppressing long 
range interactions is of central importance to de novo protein folding. 

The results may be summarized by considering the underlying fold- 
ing energy landscape (Fig. 4d). By binding and protecting partial folds, 
TF induces energy valleys or deepens existing ones, and hence directs 
folding pathways along these states towards native folds and away 
from misfolds. In contrast, the chaperone SecB binds only non-folded 
polypeptide, in a manner that suppresses folding altogether’. The 
ability of TF to combine binding with folding could stem from its 
highly flexible structure and heterogeneous surface’. These features 
may allow TF to interact dynamically with substrates as they fold, 
either by remaining partially bound when folding transitions occur, 
or by unbinding fully. Interestingly, the data suggest a simple mech- 
anism by which TF resolves the generic chaperone challenge of dis- 
tinguishing native folding from misfolding: by interacting with a 
restricted region of the polypeptide, TF could effectively promote local 
native contacts by shielding this region from misfolding interactions 
with distant regions, hence allowing shielded domains to fold quasi- 
independently from the rest. Given the minimal requirements for this 
mechanism, it is intriguing to consider whether it is employed by other 
chaperone systems as well. 

Our study has shown that the conformational search of a polypep- 
tide for its native state can be influenced directly by a chaperone. The 
generic interaction between chaperones and proteins suggests that this 
chaperone-induced reshaping of the folding energy landscape is of 
general importance to our understanding of protein folding. 


METHODS SUMMARY 


Expression and purification of MBP and MBP, constructs, DNA linker and bead 
preparation, and the characteristics of the optical tweezers have been described 
previously*®. Expression and purification of TF is described in Methods. Bulk 
refolding of MBP, was assayed by chemical denaturing, followed by refolding in 
the presence or absence of TF, testing of the ability to bind amylose resin, and 
quantification of protein amounts by SDS-PAGE, Coomassie staining and densi- 
tometric analysis using the analysis software Image Gauge (Methods). To test the 
affinity of TF for MBP partial folds, N-terminally biotinylated MBP truncates 
(Supplementary Fig. 4) were engineered, purified (Methods), immobilized on 
Streptactin Sepharose and incubated with purified TF. Bound TF was eluted 
and detected by western blotting using TF-specific polyclonal antibodies 
(Methods). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Contact map with high-energy contact regions. The 3D conformation of MBP 
(PDB 1jw4) can be represented in a symmetrical, square, Boolean adjacency 
matrix (Ay) of pairwise, inter-residue contacts. Aj was constructed using a thresh- 
old of 7 A in spatial distance between the residue’s Co. atoms. Nearest neighbour 
contacts, with a distance d(Aj, Axi) = ((i- ky + (Gji- 17)" of less than 2, were 
considered to form a single contact region. The shortest range interactions, 
between residues spaced less than 20 residues along the polypeptide, were not 
considered. The stability of the contact regions was crudely estimated by summing 
empirical Thomas-Dill energies” of all the residue-residue contacts within that 
region. The residues involved in the 5 most stable contact regions are indicated in 
red in Supplementary Fig. 4. These contact residues define the ends of a sub- 
structure, as indicated in purple in Supplementary Fig. 4. 4 of the 5 sub-structures 
are nested, such that for instance one large sub-structure (Supplementary Fig. 4e) 
encompasses 3 other sub-structures (Supplementary Fig. 4b-d). One of the stable 
sub-structures is independent of the others (Supplementary Fig. 4a). One may 
consider that two independent sub-structures are present simultaneously, and 
either interact and thus form one larger fold (for example, Supplementary Fig. 4a 
and e, which together form the core state), or do not interact, giving two distinct 
folds. 

TF expression and purification. TF with a Ulpl-cleavable N-terminal His,- 
SUMO tag was expressed from vector pCA528-tig in E. coli BL21 DE3Atig::Kan 
by induction with 1 mM isopropyl-1-thio-B-p-galactopyranoside at 30°C. The 
protein was purified using Ni-IDA matrix (Protino, Macherey-Nagel); eluted 
material was supplemented with His,-Ulp1 protease, dialysed overnight, and 
contaminants were removed by incubation with Ni-IDA matrix and subsequent 
anion-exchange chromatography (Resource Q, GE Healthcare). 

Bulk MBP and MBP, refolding assay. 2.51.M MBP, and 104M MBP were 
denatured by incubation in RF buffer (50 mM HEPES-KOH, pH 7.5, 100 mM KCl, 
5mM MgCl.) containing 8 M urea at 37°C for 12h and subsequently for 5 min at 
60 °C. Samples were diluted 1:10 in RF buffer in the absence or presence of TF (2 [1M) 
ina final volume of 50 pl and incubated for 15 min at 30 °C. Forty microlitres amylose 
magnetic beads (NEB) equilibrated in RF buffer were added and samples were 
incubated at 4°C for 15 min. Resin was washed four times in 1 ml RF buffer and 
bound MBP or MBP, was eluted by resuspending the resin in 40 il RF buffer contain- 
ing 20mM maltose followed by centrifugation and collection of the supernatant. 
Samples were subjected to SDS-PAGE followed by silver staining (Supplementary 
Fig. 5). Protein bands corresponding to the MBP proteins were quantified using 
Image Gauge (Fujifilm Science Lab). 

Expression and purification of N-terminally biotinylated MBP truncates. 
N-terminally biotinylated MBP fragments were expressed as hybrid proteins con- 
sisting of an Ulp1-cleavable N-terminal His;)-SUMO tag followed by a TEV 
protease-cleavage site, an AviTag (Avidity, LCC), which is biotinylated in vivo, 
and the respective MBP fragment. Proteins were expressed in E. coli BL21 cells 
harbouring pBirAcm, an IPTG inducible plasmid with an extra copy of the birA 
gene encoding biotin ligase (Avidity, LCC) in LB medium supplemented with 


20 mg! ' biotin, 20 mg!‘ kanamycin, 10 mg! ' chloramphenicol, 0.2% glucose. 
Cells were lysed in denaturing buffer D containing 20mM Tris-HCl pH 7.5, 
0.2M NaCl, 8M urea, the lysate was incubated at room temperature for 1h, 
cleared from cell debris by centrifugation at 35,000g for 30 min and incubated 
with Ni-IDA matrix (Protino, Macherey-Nagel) for 30min. The matrix was 
washed extensively with buffer D followed by several washing steps in refolding 
buffer R (20 mM Tris-HCl pH 7.5, 0.2 M NaCl, 1 M urea) and finally buffer MBP 
(20 mM Tris-HCl pH 7.5, 0.2 M NaCl). Hybrid proteins were eluted in buffer MBP 
containing 500mM imidazole. The eluate was supplemented with His,-Ulp1 
protease, dialysed overnight in 20 mM Tris-HCl pH 7.5, 0.2 M NaCl, 10% glycerol, 
and remaining contaminants (Hisg- Ulp1 protease and His,9-SUMO) were removed 
by incubation with Ni-IDA matrix. 

Interaction studies of TF with MBP and MBP truncates. Saturating amounts of 
N-terminally biotinylated MBP or MBP truncates were immobilized by incuba- 
tion with 150 ul Streptactin Sepharose (IBA) for 15 min at 4°C in MBP buffer 
(20 mM Tris-HCl pH 7.5, 0.2 M NaCl). Unbound protein was removed by repet- 
itive washing steps using MBP buffer. TF interaction with immobilized MBP or 
MBP truncates was studied by a one-hour incubation with purified TF (5 1M) at 
room temperature followed by 5 times washing for 5 min in MBP buffer supple- 
mented with 0.1% NP40. Bound TF was eluted by boiling in SDS sample buffer 
(125 mM Tris-HCl pH 6.8, 3% SDS, 3% B-mercaptoethanol, 0.01% bromophenol 
blue, 10% glycerol) for 5 min. Detection of TF was done by standard western 
blotting procedure using TF-specific polyclonal antibodies from rabbit (custom 
made by Charles River Laboratories). 

FTIR measurements. An MBP truncate (residues 57-267, see Supplementary 
Fig. 4e) was purified as described above except that hybrid protein was directly 
digested while bound to the IDA matrix by overnight incubation with the Hisg- 
Ulp1 protease in buffer MBP (20 mM Tris-HCl pH7.5, 0.2 M NaCl). The eluate 
containing the biotinylated MBP truncate free of the SUMO-tag and the protease 
(both were His-tagged and remained bound to the affinity matrix) was concen- 
trated using Vivaspin concentrators (Sartorius) to a concentration of 2mg ml! 
and dialysed overnight at 4°C in 20mM Tris-HCl pH7.5, 0.2M NaCl, 10% 
glycerol. For FTIR measurements, purified TF and the MBP truncate were dialysed 
in separate dialysis bags in the same buffer container to exclude any buffer differ- 
ences. After dialysis, concentrations of TF and MBP fragment were determined 
using the Biorad Protein Assay and FTIR measurements were performed at 1:1 
stoichiometry at protein concentrations of approximately 30 14M. FTIR spectra 
were taken from 20 to 60°C using Bruker Tensor27 instrumentation equipped 
with BioATRCell II. For each spectrum, 60 scans were conducted over a range 
from 3,000 to 1,000 cm !. Between each series of scans, temperature was raised by 
1°C followed by an equilibration time of 2 min. Spectra were corrected by sub- 
traction of the background water spectrum, compensated for water vapour, and 
analysed by determining the area under the amide I band. 


30. Thomas, P.D.& Dill, K.A. An iterative method for extracting energy-like quantities 
from protein structures. Proc. Nat! Acad. Sci. USA 93, 11628-11633 (1996). 
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Structural basis for the inhibition of bacterial 


multidrug exporters 


Ryosuke Nakashima!, Keisuke Sakurai!, Seiji Yamasaki’, Katsuhiko Hayashi’, Chikahiro Nagata’, Kazuki Hoshino’, 


Yoshikuni Onodera®, Kunihiko Nishino* & Akihito Yamaguchi"? 


The multidrug efflux transporter AcrB and its homologues are 
important in the multidrug resistance of Gram-negative pathogens’. 
However, despite efforts to develop efflux inhibitors’, clinically useful 
inhibitors are not available at present*’. Pyridopyrimidine derivatives 
are AcrB- and MexB-specific inhibitors that do not inhibit MexY°’; 
MexB and MexY are principal multidrug exporters in Pseudomo- 
nas aeruginosa* °. We have previously determined the crystal struc- 
ture of AcrB in the absence and presence of antibiotics'**. Drugs 
were shown to be exported by a functionally rotating mechanism” 
through tandem proximal and distal multisite drug-binding pock- 
ets’’. Here we describe the first inhibitor-bound structures of AcrB 
and MexB, in which these proteins are bound by a pyridopyrimidine 
derivative. The pyridopyrimidine derivative binds tightly to a narrow 
pit composed of a phenylalanine cluster located in the distal pocket 
and sterically hinders the functional rotation. This pit is a hydro- 
phobic trap that branches off from the substrate-translocation chan- 
nel. Phe 178 is located at the edge of this trap in AcrB and MexB and 
contributes to the tight binding of the inhibitor molecule through a 
m-1 interaction with the pyridopyrimidine ring. The voluminous side 
chain of Trp 177 located at the corresponding position in MexY pre- 
vents inhibitor binding. The structure of the hydrophobic trap 
described in this study will contribute to the development of universal 
inhibitors of MexB and MexY in P. aeruginosa. 

Members of the resistance-nodulation-division (RND) family of 
transporters, which include AcrB, are tripartite complexes composed 
of an inner membrane transporter, a membrane fusion protein and an 
outer membrane channel’*"*. Among the RND transporters, AcrAB- 
TolC and its homologues (including MexAB-OprM and MexXY- 
OprM) are essential for intrinsic antibiotic tolerance and bacterial 
survival during infection’®’’. Most clinically isolated P. aeruginosa 
strains with multidrug-resistance phenotypes overexpress MexAB 
and/or MexXY"*. Over the last decade, various compounds have been 
randomly screened and subsequently modified to inhibit these efflux 
transporters*’’. Among them, two groups of effective inhibitors, the 
derivatives of Phe-Arg B-naphthylamide (PABN)'?”° and tert-butyl 
thiazolyl aminocarboxyl pyridopyrimidine®”, are the most commonly 
investigated. However, both groups of inhibitors are still in preclini- 
cal development*. Although the competitive inhibitor PABN exhibits 
a broad synergistic effect with antibiotics against multidrug-resistant 
P. aeruginosa "’, its toxicity is probably due to the induction of membrane 
dysfunction”””’. However, pyridopyrimidine derivatives do not cause 
membrane dysfunction and have almost no antibiotic activity. These 
derivatives are AcrAB-TolC- and MexAB-OprM-specific inhibitors®, 
but they do not inhibit MexXY-OprM, which is the main reason why 
pyridopyrimidine derivatives are not useful as clinical drugs to target 
multidrug-resistant P. aeruginosa. Among pyridopyrimidine derivatives, 
[[2-({[((3R)-1-{8-{[(4-tert-butyl-1,3-thiazol-2-yl)amino]carbonyl}- 
4-oxo-3-[(E)-2-(1H-tetrazol-5-yl)vinyl]-4H-pyrido|[1,2-a]pyrimidin- 
2-yl}piperidin-3-yl)oxy]carbonyl}amino) ethyl] (dimethyl) ammonio] 


acetate (D13-9001 (ref. 7), which we refer to as AcrAB/MexAB- 
specific inhibitor of pryridopyrimidine derivative (ABI-PP) in this 
manuscript (Supplementary Fig. 1), is an efficient inhibitor and has 
a relatively high water solubility’. Unlike PABN7', ABI-PP is hardly 
exported by efflux transporters (Supplementary Fig. 2). ABI-PP binds 
to purified AcrB and MexB proteins, but not to purified MexY protein 
(Supplementary Fig. 3b). The binding of a distal-binding drug to 
purified exporter proteins was competitively inhibited by ABI-PP, 
but the binding of a proximal-binding drug was not affected (Sup- 
plementary Fig. 4b), indicating that ABI-PP binds to the distal pocket. 
ABI-PP binding was not affected by either drug. However, ABI-PP 
potentiates the activities of a wide variety of drugs and toxic com- 
pounds including both distal- and proximal-binding drugs against 
MexAB-OprM-expressing cells, but fails to enhance the effects of 
these drugs against MexXY-OprM-expressing cells (Supplementary 
Fig. 5). The purpose of this study was to determine the structural basis 
of the inhibition of efflux transporters by determining the crystal 
structure of AcrB and MexB bound to the pyridopyrimidine derivative 
ABI-PP. Our work will provide a structural basis for designing a uni- 
versal inhibitor of multidrug efflux transporters. Although bound 
PABN was detected in the AcrB(N109A) mutant”, the physiological 
importance of the structure is not clear”’. The structures reported in 
this manuscript represent, to our knowledge, the first examples of 
physiologically important asymmetric structures of efflux transporters 
bound to inhibitors. 

We solved the crystal structure of AcrB with bound ABI-PP at a 
resolution of 3.05 A (Fig. 1a, b). ABI-PP binds to a narrow pit composed 
of a phenylalanine cluster in the distal pocket of the binding monomer 
(Fig. 1c and Supplementary Fig. 6a). The hydrophobic tert-butyl thia- 
zolyl aminocarboxyl pyridopyrimidine (TAP) moiety is surrounded by 
phenylalanine residues, including F136, F178, F610, F615 and F628. The 
pyridopyrimidine ring and thiazolyl ring strongly interact with F178 
and F628, respectively, through n-n stacking interactions. However, 
the tetrazole ring and the piperidine aceto-amino ethylene ammonio- 
acetate (PAEA) moiety interact with ionic and/or hydrophilic residues 
including N274, R620, Q176 and S180 in addition to aliphatic residues 
including 1277 and L177 (Fig. 1c). Figure 1d and e present cutaway views 
of the distal binding pockets with bound ABI-PP superimposed with 
bound minocycline and doxorubicin. Figure 1d provides a view towards 
the channel’s exit and Fig. le provides a view towards the entrance from 
the proximal pocket’. As shown in these figures, minocycline’” and 
doxorubicin™ are bound in the substrate-translocation channel from 
the entrance to the exit. The ethyl tetrazole group and the PAEA moiety 
of ABI-PP are also located in the substrate-translocation channel and 
almost overlap with the minocycline and doxorubicin molecules. Q176, 
L177 and 1277 interact with both doxorubicin and ABI-PP. N274, R620 
and $180 interact with both minocycline and ABI-PP. However, the 
TAP moiety is deeply inserted into a narrow phenylalanine-rich trap 
that branches off of the substrate-translocation channel. 


1Department of Cell Membrane Biology, Institute of Scientific and Industrial Research, Osaka University, Ibaraki, Osaka 567-0047, Japan. *Graduate School of Pharmaceutical Sciences, Osaka University, 
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Figure 1 | Crystal structure of the inhibitor-bound AcrB trimer. Three AcrB 
monomers are shown in blue, red and green to indicate the binding, extrusion 
and access monomers, respectively. The difference Fourier map (Finhipitor— 
Free) contoured at 3.50 (a and b) and 3.0 (c) is represented by an orange mesh. 
This colour scheme is used in all other figures. a, The side view of the entire 
structure of the ABI-PP-bound AcrB trimer. b, Cutaway view of the head-piece 


We also solved the crystal structures of MexB bound by ABI-PP at 
3.15 A (Fig. 2a, b) and drug-free MexB at 2.7 A (Supplementary Fig. 7). 
As illustrated in the close-up view (Fig. 2c and Supplementary Fig. 6b), 
the ABI-PP binding site in MexB is similar to the site in AcrB. The 
hydrophobic TAP moiety is inserted into the narrow hydrophobic trap 
surrounded by F136, F178, F610, F615, F628 and F573. The hydro- 
philic tetrazole ring interacts with D274, R620 and K151. The con- 
formations of these two ABI-PP moieties are almost the same as their 
conformations in AcrB. However, the conformation of the PAEA 
moiety in MexB is different from that in AcrB; specifically, this moiety 
extends towards R128 in MexB (Fig. 2c), but it is bent between the 
piperidine ring and the aceto-amino ethylene ammonio-acetate group 
in AcrB (Fig. 1c). As shown in Fig. 2d-f, the distal pocket is separated 
into two parts: a relatively hydrophilic substrate-translocation channel 
and a hydrophobic trap. The channel has sufficient space for multisite 
drug binding, and the trap forms a deep and narrow fissure. Our crystal 
structure of drug-free MexB (Supplementary Fig. 7) is largely identical 
to the structure determined at 3.0A in ref. 25. In our structure, the 
electron density corresponding to an entire molecule of n-dodecyl-B-p- 
maltoside (DDM) in the distal pocket is evident (Supplementary Fig. 8a) 
and the bound DDM partially overlaps with the bound ABI-PP in the 
narrow hydrophobic trap (Supplementary Fig. 8b, c). 

The crystal structure of MexY has not yet been solved. However, we 
constructed a homology model of MexY (using SWISS-MODEL”*) on 
the basis of the ABI-PP-bound AcrB/MexB structure. Figure 3 shows a 
close-up view of the ABI-PP-binding region. The most notable differ- 
ence between AcrB/MexB and MexY in this region is that the position 
of F178 in AcrB and MexB at the edge of the hydrophobic trap (Fig. 3a, 
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from the distal side of the cell. c, Close-up view of the ABI-PP-binding site. 
Carbon atoms of ABI-PP and amino acid residues are shown in yellow and 
green, respectively. d, e, Cutaway views of the distal drug-binding pocket 
towards the exit (d) and the entrance (e). Bound minocycline” (blue) and 
doxorubicin” (green) are superimposed. 


b) is occupied by W177 in MexY (Fig. 3c). The voluminous indolyl side 
chain hinders the binding of ABI-PP to this trap, which may be the 
reason why ABI-PP does not inhibit MexY. We constructed the 
AcrB(F178W) mutant and solved its structure (Fig. 3d and Sup- 
plementary Fig. 12a), which revealed that the indolyl side chain of 
W178 hindered ABI-PP binding, as in MexY. Therefore, to deter- 
mine the effect of steric hindrance at this position on the inhibitory 
activity, we constructed the mutants MexY(W177F), MexB(F178W) 
and AcrB(F178W) using site-directed mutagenesis. Mutagenesis did 
not affect the expression levels of any of the proteins (Supplementary 
Fig. 9). Although the AcrB(F178A) mutation causes a decrease in anti- 
biotic resistance’’, the F178W mutation in AcrB and MexB and the 
W177F mutation in MexY did not alter the antibiotic resistance pattern 
(Supplementary Table 2), suggesting that these mutations may not 
cause conformational changes in the substrate-translocation channel. 
The inhibitory effect of ABI-PP on efflux transporters was measured 
by examining its effect on cell growth and its ability to prevent the 
accumulation of doxorubicin in Escherichia coli cells (Fig. 4). In the 
absence of antibiotics, the growth rates of wild-type and mutant AcrB-, 
MexB- and MexY-expressing cells were similar to each other and were 
unaffected by the addition of ABI-PP (Supplementary Fig. 10). Wild- 
type E. coli cells are resistant to erythromycin, whereas acrB-deficient 
E. coli cells are highly sensitive to this drug (Fig. 4a), indicating that the 
intrinsic tolerance of E. coli to erythromycin is due to efflux by the 
AcrAB-TolC system. The quenching of doxorubicin fluorescence was 
prevented in wild-type E. coli cells by the AcrAB-TolC system, but the 
fluorescence in acrB-deficient and acrB/tolC-deficient cells was quenched 
due to the accumulation of doxorubicin in the cells (Fig. 4f). In wild-type 
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Figure 2 | Crystal structure of the inhibitor-bound MexB trimer. a, Entire 
structure of the ABI-PP-bound MexB trimer. b, Cutaway view of the head-piece 
of the ABI-PP-bound MexB. The F, - F. omit map is represented by an orange 
mesh, contoured at 3.50 (a and b). c, Close-up stereo view of the ABI-PP- 
binding site of MexB as shown in Fig. 1c, except that the electron density map of 
ABI-PP is calculated as an F, - F, omit map contoured at 3.00 because 


AcrB-expressing cells, cell growth was inhibited by ABI-PP in the pres- 
ence of erythromycin (Fig. 4b) and fluorescence quenching was increased 
by ABI-PP (Fig. 4g), indicating that AcrB was inhibited by ABI-PP. By 
contrast, in wild-type MexY-expressing cells, cell growth and fluor- 
escence quenching were not affected by ABI-PP (Fig. 4d, i), indicating 
that MexY was not inhibited by ABI-PP. The doxorubicin fluorescence in 
the MexY-expressing cells may have been somewhat quenched even in 
the absence of ABI-PP (Fig. 4i) because MexY is not able to completely 
prevent doxorubicin accumulation. The degree to which the growth of 
AcrB(F178W)-expressing cells was inhibited by ABI-PP was consid- 
erably decreased (Fig. 4c), and the quenching of doxorubicin fluorescence 
was completely prevented, even in the presence of ABI-PP (Fig. 4h), 
indicating that the AcrB(F178W) mutant is no longer inhibited by 
ABI-PP. By contrast, the growth of the MexY(W177F)-expressing cells 
was inhibited by ABI-PP (Fig. 4e), and the degree of quenching was 
increased by ABI-PP (Fig. 4)), indicating that the MexY(W177F) mutant 
is inhibited by ABI-PP. These results clearly indicate that the presence ofa 
voluminous side chain at the edge of the hydrophobic trap in MexY 
reduces the inhibitory activity of ABI-PP. 

However, in the case of MexB, the F178W mutation caused no 
change in the inhibitory activity of ABI-PP, as evidenced by the 
ABI-PP-induced inhibition of cell growth and the accumulation of 
doxorubicin (Supplementary Fig. 11). To reveal the reason for the 
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drug-free MexB binds a detergent molecule. d—f, Cutaway view of the distal 
drug-binding pocket of the ABI-PP-bound MexB towards the exit (green oval; 
d), looking down the hydrophobic trap (white oval) from the substrate- 
translocation channel (e) and towards the entrance (green oval; f). The 
hydrophobicity of the inner wall of the pocket is coloured according to 
Eisenberg’s hydrophobicity scale. 


difference between AcrB(F178W) and MexB(F178W) with respect to 
the response to ABI-PP, we determined the crystal structures of these 
mutants. The overall structures of AcrB(F178W) and MexB(F178W) 
were almost the same as the wild-type structures of AcrB and MexB, 
respectively. A close-up view of the hydrophobic trap showed that the 
indolyl side chain of W178 in the AcrB mutant protrudes into the 
narrow space of the hydrophobic trap (Supplementary Fig. 12a), 
whereas the indolyl side chain of W178 in the MexB mutant does not 
protrude towards the hydrophobic trap (Supplementary Fig. 12b), 
because that space is partially occupied by a DDM molecule, as in the 
wild-type MexB. We succeeded in determining the crystal structure of 
ABI-PP-bound MexB(F178W). ABI-PP was actually detected in the 
hydrophobic trap, as in the wild-type MexB (Supplementary Fig. 12c). 
The indolyl ring of W178 fits into a position that is capable of forming 
m-1 interactions with the pyridopyrimidine ring without large confor- 
mational change relative to the ABI-PP-free structure. By contrast, in the 
case of AcrB(F178W) and MexY, such conformation of the indolyl ring 
seems to be impossible because of the steric hindrance with V139 and 
1138, respectively (Supplementary Fig. 13). When these residues were 
replaced by alanine, the resulting mutants, AcrB(F178W/V 139A) and 
MexY(I138A), became inhibited by ABI-PP (Supplementary Fig. 14). 
These results clearly indicate that the sensitivity of exporters to ABI-PP 
depends on the space of the inhibitor-binding pit. 
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a AcrB 


b MexB 


Figure 3 | Comparison of the close-up views of the inhibitor-binding site of 
multidrug efflux transporters. The amino acid residues around this site are 
depicted using the space-filling model. The phenyl and indolyl side chains of 
F178 and W177 are shown in white. a, AcrB. b, MexB. c, Homology model of 
MexY overlapped with MexB-bound ABI-PP. The alignment of AcrB, 


In this study, we found that ABI-PP is deeply inserted into a hydro- 
phobic trap that branches off of the substrate-translocation channel in 
the distal binding pocket of AcrB/MexB. This trap is rich in phenyla- 
lanine residues, which do not directly interact with the exported drugs 
but have been shown to indirectly affect drug binding by site-directed 
mutagenesis” and molecular dynamics simulations”. ABI-PP is not 
exported by AcrB/MexB. ABI-PP binds tightly to this trap and inhi- 
bits the functional rotation mechanism’ of AcrB/MexB monomers. 
Therefore, it potentiates the activities of all of the antibiotics exported 
by AcrB/MexB not only distal-binding drugs but also proximal-binding 


a _ No plasmid b Wild-type AcrB 


© AcrB(F178W) 
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d_ AcrB(F178W) 


c¢ MexY 


MexB and MexY was generated by GENETYX (v9, GENETYX). The 
homology model of MexY was calculated using SWISS-MODEL”. 

d, AcrB(F178W) overlapped with AcrB-bound ABI-PP. In this panel, W178 is 
shown in white. 


drugs. By contrast, ABI-PP does not inhibit MexY because the volu- 
minous side chain of W177 is located in this trap. The inhibitor spe- 
cificity is determined by the small difference in the amount of space 
in the hydrophobic trap. Several classes of AcrB/MexB inhibitors 
have been developed but their mechanisms of action remain largely 
unknown. This study represents the first success, to our knowledge, 
in solving the inhibitor-bound structures of AcrB/MexB. The develop- 
ment of clinically useful inhibitors for multidrug efflux transporters is of 
key importance because such agents might make existing antibiotics 
more effective. This study provides detailed insight into the mechanism 
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Figure 4 | The inhibitory effect of ABI-PP. a, The growth of wild-type, acrB- 
deficient and acrB/tolC-deficient E. coli cells in the presence of 6 wg ml * 
erythromycin. b-e, The growth of E. coli cells expressing wild-type AcrB 

(b), AcrB(F178W) (c), wild-type MexY (d) and MexY(W177F) (e) in the 
presence of 6 pg ml’ (b, c) or 16 ug ml (d, e) erythromycin and various 
concentrations of ABI-PP (depicted as PP in panels). These tests were repeated 
three times. f-j, Quenching of doxorubicin fluorescence due to the 


accumulation in E. coli cells. f, The effect of acrB and/or tolC deletion. 
g-j, The effect of ABI-PP (28.6 1M) in E. coli cells expressing wild-type 
AcrB (g), AcrB(F178W) (h), wild-type MexY (i) and MexY(W177F) (j). 
The data and error bars correspond to the mean values of three 
independent experiments and the + standard deviation, respectively. a.u., 
arbitrary units. 
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by which the inhibitor ABI-PP acts on RND transporters, and our 
results will promote the structure-based development of universal inhi- 
bitors that inhibit both MexB and MexY. 


METHODS SUMMARY 


Crystallization and structural determination of inhibitor-bound AcrB and 
MexB. C2 crystals of ABI-PP-bound AcrB and drug-free AcrB(F178W) were 
generated (see Methods) using dodecanoyl sucrose and the structures were solved 
at resolutions of 3.05 and 3.60 A, respectively. P1 crystals of drug-free and ABI-PP- 
bound MexB and drug-free and ABI-PP-bound MexB(F178W) were generated 
(see Methods) using DDM, and the structures were solved at resolutions of 2.70, 
3.15, 3.30 and 3.00 A, respectively (Supplementary Table 1). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Preparation of membrane proteins. Histidine-tagged AcrB was expressed from 
the plasmid pAcBH in the E. coli strain JM109 and prepared as previously 
described'*. AcrB(F178W) was expressed from pAcBHF178W in the acrA/acrB- 
deficient E. coli strain W3104 and prepared using a method similar to that for wild- 
type AcrB. For MexB and MexB(F178W) overexpression, we used the plasmids 
pUCP20-Byis (provided by T. Nakae”’) and pUC118mexB F178W (pUC118 DNA, 
Takara Bio), respectively, and the protein was expressed with a carboxy-terminal 
polyhistidine tag in the wild-type and acrB-deficient E. coli strain MG1655, respect- 
ively. Bacteria were grown to an Asgo nm Of 0.7 at 37 °C and MexB expression was 
induced with 0.1 mM isopropyl-f-p-1-thiogalactopyranoside for 4 h. The cells were 
disrupted with a high-pressure homogenizer (APV 1000, SPX) equipped with a 
sample cooling unit at 180 MPa and the membrane fractions were collected by 
ultracentrifugation at 158,000g for 90 min. The purified membrane fractions were 
solubilized in buffer A (20 mM Tris-HCl (pH 7.5), 300 mM NaCl and 10% glycerol) 
containing 1.5% DDM (D316, Dojindo). The solubilized protein mixture was cen- 
trifuged for 1 h at 172,000g, and the supernatant was applied to a Ni Sepharose Fast 
Flow column (GE Healthcare) pre-equilibrated with buffer A containing 0.05% 
DDM. The column was washed using buffer A plus DDM containing 25mM 
imidazole (pH 7.5) and subsequently washed with 140 mM imidazole. MexB was 
eluted with 350 mM imidazole. Purified MexB was concentrated using an Amicon 
stirred cell (Model 8010, Millipore) and exchanged into buffer composed of 10 mM 
Tris-HCl (pH 7.5), 50 mM NaCl and 0.05% DDM with three concentration-dilution 
steps. Finally, the proteins were concentrated to approximately 30 mg ml’. 
Crystallization. Crystals of AcrB complexed with ABI-PP were prepared as 
previously described’ with an inhibitor:protein molar ratio of 5:1. Crystals of 
AcrB(F178W) were prepared in a manner similar to that of the wild-type protein. 
MexB crystals were grown at 25°C using the hanging drop vapour diffusion 
method. The protein solution (25mgml') was mixed with an equal volume of 
reservoir solution containing 50 mM Na acetate-HCl (pH 5.5), 300 mM NaCl and 
26% (v/v) polyethylene glycol 400 (conditions similar to those in ref. 25). Crystals 
appeared within 5days and were collected after a further 5 days. Crystals of 
MexB(F178W) were prepared using a method similar to that for the wild-type 
protein, except that n-octyl-B-p-thioglucoside was used as an additive. Crystals of 
MexB complexed with ABI-PP were prepared using a method similar to that for 
AcrB. Cryoprotection was achieved by increasing the polyethylene glycol 400 
concentration to 40% (v/v) in three steps. The crystals were flash frozen in liquid 
nitrogen or cryostream (100 K). 

Crystallographic analysis. Each data set was collected using the BL44XU beam- 
line at SPring-8 with an MX225-HE CCD detector (Rayonix) at 100 K. The dif- 
fraction data were processed and scaled using the HKL2000 (ref. 30) package. The 
crystals belonged to the C2 space group (AcrB and AcrB(F178W)) and the P1 
space group (free and inhibitor-bound MexB), and the cell dimensions are listed 
in Supplementary Table 1. Other crystallographic analysis procedures were per- 
formed as described previously’. 

Site-directed mutagenesis. The acrB gene was amplified from E. coli MG1655 
(ref. 31) genomic DNA using the Eco_acrB-F_Sall (5'-CGCGTCGACAGTCT 
TAACTTAAACAGGAGC-3') and Eco_acrB-R_SphI (5'-CGCGCATGCTCAA 
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TGGTGATGGTGATGATGATCGACAGTATGGCT-3’) primers, which intro- 
duce SalI and SphI restriction sites at each end of the amplified fragment (under- 
lined in the primer sequences above). This PCR product was cloned into the SalI 
and SphlI sites of the pBAD33 vector® to produce the pBADacrB plasmid. Point 
mutations were introduced using mutagenic primers to create the F178W 
(TTC-+TGG) and V139A (GTT-—+GCG) substitutions. The pMexAB"'M and 
pMexXY"'M plasmids were kindly provided by T. Nakae*’. Point mutations were 
introduced using mutagenic primers to create the MexB(F178W) (TTC-—TGG), 
MexY(W177F) (TGG-TTT) and MexY(I138A) (ATC-+GCG) substitutions. 
The constructed plasmids were sequenced using a 3100-Avant Genetic Analyzer 
(Applied Biosystems) to ensure the presence of the desired mutation. 
Erythromycin susceptibility tests in the presence of ABI-PP. Single colonies of 
E. coli AacrB/pBADacrB, AacrBAtolC/pMexAB"*M and AacrBAtolC/pMexXY"M 
strains were inoculated into 2 ml of Luria-Bertani (LB) broth. Cultures were grown 
overnight at 37 °C, and 2 X 10’ colony-forming units pl! bacteria were inoculated 
into 200 ul of LB broth containing erythromycin (6 jg ml’ for AcrB-expressing 
cells, 2 4gml~' for MexAB-OprM-expressing cells and 16 pgml~' for MexXY- 
OprM-expressing cells) in the presence of various concentrations of ABI-PP and 
10 mM arabinose. Bacterial growth was measured using 96-well plates and a Versa 
Max microplate reader (Molecular Devices). 

Doxorubicin efflux assay. Overnight cultures of E. coli were diluted in LB broth 
to achieve a cell density of 10’ cellsml~'. The cells were grown under aerobic 
conditions at 37°C in the presence of 10mM arabinose as an inducer until 
A600 nm = 0.3 was reached. The cells were then collected, washed twice with 
100 mM potassium phosphate buffer (pH 7.5 containing 5 mM MgSO,) and re- 
suspended in the same buffer to adjust the Agoo to 18. Each assay was performed in 
96-well black flat-bottom plates (Iwaki) using a final cell volume of 100 ul, and 
doxorubicin (Sigma-Aldrich) was added to a final concentration of 28.6 1M. For 
the inhibition assay, 28.6 1M ABI-PP was added. The fluorescence of doxorubicin 
was measured using an SH-8100 microplate reader (Corona Electric) with excita- 
tion set at 480 nm and emission at 600 nm. The fluorescence intensity decreased in 
a time-dependent manner owing to the accumulation of doxorubicin in the cells, 
and the decrease in fluorescence intensity was prevented when doxorubicin was 
expelled by an efflux pump. 
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Unusual base pairing during the decoding of a stop 


codon by the ribosome 


Israel S. Fernandez', Chyan Leong Ng't, Ann C. Kelley’, Guowei Ww’, Yi-Tao Yu? & V. Ramakrishnan! 


During normal translation, the binding of a release factor to one of 
the three stop codons (UGA, UAA or UAG) results in the termi- 
nation of protein synthesis. However, modification of the initial 
uridine to a pseudouridine (‘V) allows efficient recognition and read- 
through of these stop codons by a transfer RNA (tRNA), although it 
requires the formation of two normally forbidden purine—purine 
base pairs’. Here we determined the crystal structure at 3.1 A reso- 
lution of the 30S ribosomal subunit in complex with the anticodon 
stem loop of tRNA** bound to the WAG stop codon in the A site. 
The WA base pair at the first position is accompanied by the forma- 
tion of purine-purine base pairs at the second and third positions 
of the codon, which show an unusual Watson-Crick/Hoogsteen 
geometry. The structure shows a previously unsuspected ability of 
the ribosomal decoding centre to accommodate non-canonical base 
pairs. 

The genetic code normally requires Watson—Crick base pairing at 
the first two positions of the codon—anticodon helix, and tolerates cer- 
tain specific mismatches at the third (wobble) position. The structural 
basis for this was apparent when three universally conserved bases in 
the A site of the small (30S) ribosomal subunit were shown to change 
conformation to monitor Watson—Crick geometry at the first two base 
pairs in the minor groove of the codon-anticodon helix, while leaving 
the wobble position relatively unconstrained’. These interactions result 
in additional binding energy that is used to induce global conforma- 
tional changes that facilitate the hydrolysis of GTP by the elongation 
factor EF-Tu*”, as was predicted by earlier kinetic data®. However, 
recently it was shown that modification of the uridine in stop codons 
to Y (Fig. 1a) allowed ‘YAA and ‘PAG to code for serine or threonine 
and GA to code for phenylalanine or tyrosine’. 

To understand how the normally forbidden base pairs in such recod- 
ing can be accepted by the ribosome, we determined the crystal struc- 
tures of the anticodon stem loops (ASLs) of tRNAS* and tRNAPPE 
bound to the modified stop codons “AG and ‘PGA, respectively, in 
the A site of the 30S subunit, as well as the crystal structure of the most 
common bacterial tRNA** bound to WAG in the entire 70S ribosome. 

We first showed that nonsense suppression by ¥ also occurs in 
bacteria. We constructed three synthetic messenger RNAs encoding 
a 6X -histidine tag at the amino terminus and a Flag tag at the carboxy 
terminus (Fig. 1b, top). The three mRNAs contain either an amber 
stop (UAG), a glutamine sense (CAG) or a pseudouridylated amber 
(WAG) codon inserted just before the Flag tag. After in vitro trans- 
lation, anti-6%-histidine immunoblotting revealed that the three 
constructs are translated at the same level (Fig. 1b, bottom left). The 
anti-Flag immunoblot had a signal comparable to background when the 
normal UAG amber codon was present, showing that normal termina- 
tion occurred and the downstream Flag sequence was not translated 
(Fig. 1b, bottom right). However, the presence of the AG amber 
codon increased the signal from the Flag tag to a level comparable to 
that of the CAG sense codon, showing that the substitution of U by ¥ 
results in a strong read-through of the amber stop codon in bacteria, 
as previously reported in eukaryotes’. Given the conservation of the 
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c 
tRNAS* ASL 3’ 5’ 


tRNA 
36.35 34 


a 


YAG 


1 2 3 
mRNA 


Figure 1 | Chemical differences between uridine and pseudouridine, and 
experimental set-up. a, Uridine (U; 1-B-p-ribofuranosyluracil) and pseudouri- 
dine (‘Y; 5-B-p-ribofuranosyluracil). b, Top, diagram of the three synthetic mRNA 
constructs designed for the in vitro nonsense suppression experiment in bacteria. 
Bottom, anti-His and anti-Flag immunoblot analysis of the in vitro translation 
assays in Escherichia coli. c, The tRNA*" ASL and mRNA used in this study. 
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decoding centre, it is likely that the recoding also specifies the same 
amino acids as previously established in eukaryotes. 

As recoding and the decoding centre are both conserved across the 
two kingdoms, we determined the structure of the ASL for the yeast 
tRNA“, which has an IGA anticodon (http://gtrnadb.ucsc.edu/Sacc_cere/) 
(Fig. 1c) bound to a modified ‘¥AG stop codon in the 30S subunit, as 
previously described for cognate tRNA’ (Supplementary Methods and 
Supplementary Table 1). 

The codon and the ASL could clearly be seen in difference Fourier 
maps (Fig. 2a), allowing an unambiguous determination of the confor- 
mation of the bases involved (Fig. 2b). The overall conformations of the 
AG codon and ASL in the A site are very similar to those observed 
previously for phenylalanine-codon-ASL pairs in both the 30S subunit”* 
and the intact 70S ribosome’. However, the details of each of the codon- 
anticodon base pairs are markedly different, and reveal how the ribo- 
some facilitates the decoding of a normally non-cognate tRNA. 

At the first position, the 1 of the codon and A36 of the tRNA form 
the expected Watson—Crick base pair (Fig. 2b, left, and Fig. 3a). The N1 
of ¥ is exposed to solvent and does not form an additional interaction 
with either the ASL or the ribosome. The minor groove of this base pair 
is recognized by A1493 of the decoding centre (Fig. 3a). However, 
unlike in normal decoding (Fig. 3d), the electron density for A1493 
is consistent with a syn rather than an anti conformation (Supplemen- 
tary Fig. 1). 

In the second position, the A2 of the codon adopts a syn conforma- 
tion that forms two hydrogen bonds via its Hoogsteen edge® with the 
Watson-Crick edge of G35 of the ASL (Fig. 2b, centre, and Fig. 3b). 
Interestingly, the C1’-C1’ distance of 10.9 A in this non-canonical base 
pair is not much greater than the distance of 10.5 A found in a canon- 
ical pyrimidine—purine base pair with Watson-Crick geometry’. Pre- 
sumably this difference is small enough to allow the base pair to be 
accommodated in the decoding centre of the ribosome. Surprisingly, 
despite the completely different type of base pair, the conformation 


Codon-A, 
Base pair 1 


Figure 2 | Overall and detailed view of the base pairs involved in the codon- 
anticodon interaction. a, Overview of the 30S subunit and detail of the 30S 
decoding site with the ‘’AG-ASL-tRNA*™. The ASL of tRNA‘ is depicted in 
red, the WAG codon in yellow, the 16S rRNA in grey and the S12 protein in 
blue. Right, unbiased difference Fourier density for the ASL and codon. b, The 
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Base pair 2 


and interactions of the ribosomal bases A1492 and G530 are very simi- 
lar to those seen in normal decoding (Fig. 3b, e). 

Yeast tRNA** has an inosine in the third (wobble) position of the 
anticodon’. In suppression of a ‘PAG codon, the inosine would have to 
make a previously unknown I-G base pair, which would bring the two 
normally repulsive 06 groups close together. As in the second posi- 
tion, this unusual base pair is formed by a syn conformation of G3 
making a single hydrogen bond via its Hoogsteen face with the Watson- 
Crick edge of 134 of the ASL (Fig. 2b, right, and Fig. 3c). The potential 
electrostatic repulsion between the 06 atoms on G3 and 134 is over- 
come by coordination of both atoms with what is probably a magne- 
sium ion, thus stabilizing the base pair. 

To determine the generality of these interactions, we determined 
two additional structures. The first was that of bacterial tRNA** with a 
CGA anticodon, bound to mRNA in the entire 70S ribosome. Here, 
only the second base pair involves a purine-purine mismatch, which 
adopts the same Hoogsteen/Watson-Crick geometry as seen in the 
30S ribosome (Supplementary Table 1 and Supplementary Fig. 2a, c). 
We also determined the structure of the 30S subunit with a YGA 
codon bound to the ASL of RNA?” containing a GAA anticodon (Sup- 
plementary Table 1 and Supplementary Fig. 2b), as it was shown that 
YGA could be decoded by phenylalanine’. The same type of base 
pairing is seen, including the syn conformation of A1493 at the first 
position and the Hoogsteen/Watson-Crick interactions at the second 
and third positions. Because the third position here involves a GA pair 
instead of an IG, there are two hydrogen bonds and there is no coor- 
dinating magnesium ion. These structures show that the unusual 
Hoogsteen/Watson-Crick base pairs for purine-purine mismatches 
are seen for both bacterial and eukaryotic tRNAs implicated in the 
recoding of stop codons, and in the context of both 30S subunits and 
the 70S ribosome. 

The structure of RNA* ASL bound to the unmodified UAG codon 
in the 30S subunit (data not shown; Protein Data Bank (PDB) accession 


Codon-G, 


3g G-ASL 
Base pair 3 


base pairs for the codon-anticodon interaction are shown with boxes over the 
first (left), second (middle) and third (right) positions; the actual structures of 
the base pair is shown below in unbiased difference Fourier maps contoured at 
1.50. Subscripts indicate residue numbers. 
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Figure 3 | Interaction of ribosomal bases with the codon-anticodon base 
pairs. Details of base pairing for the first, second and third positions of the 
WAG-ASL-tRNA*" structure with the ribosomal bases involved in minor- 
groove recognition shown on the top (a-c). For comparison, the corresponding 


4K0K) was identical within coordinate error to that with the VAG 
codon. This suggests that any difference in the decoding properties 
must come from differences in the stability of the final complex due 
to 'Y, rather than differences in structure. 

This work shows how the normally forbidden codon-anticodon 
pairs that would be involved in recoding by ‘¥-containing stop codons 
are accommodated in the ribosomal decoding centre. 

At the first codon position, the syn conformation of A1493 facilitates 
a possibly tighter interaction with the minor groove of the Watson- 
Crick ‘VA pair (Fig. 3a) than the commonly seen anti conformation 
(Fig. 3d). The Watson—Crick/Hoogsteen base pairs at the second and 
third positions seen here have also been observed in canonical duplex 
DNA", showing that the energetic difference between canonical and 
Hoogsteen pairs is small enough for the latter to be observed as a tran- 
sient feature. However, during decoding, the ribosomal bases A1492 
and G530 were suggested to recognize specifically Watson-Crick- 
pairing geometry at the second position’ (Fig. 3e). Nevertheless, the 
same interactions can be formed with the Watson—Crick/Hoogsteen 
base pair described here (Fig. 3b), presumably because its width is 
similar and the interactions are with the 2'-OH groups of the riboses 
of the codon-anticodon base pair rather than with the bases them- 
selves. It is likely that these interactions are not as favourable as for a 
canonical Watson-Crick base pair. However, suppression of stop codons 
by tRNAs requires only that they are able to outcompete release factors; 
they do not have to be as efficient as cognate tRNAs on sense codons. 

It is not apparent why the ‘Y modification should facilitate binding 
of a non-cognate tRNA. The additional hydrogen-bond donor N1 of 
‘Y, which is not present in U (Fig. 1a), is exposed to solvent and does not 
make additional interactions with the ribosome or ligands. ‘¥ in RNA 
has been shown to result in an increased stabilization of helices with- 
out noticeable changes in structure'’’*. Understanding how V results 
in increased stabilization of helices will require both high-resolution 
data on model systems as well as computational and biochemical stu- 
dies. Determining the effect of Y on the rate constants of various steps 
in decoding’, or on efficiency of termination, will also help to clarify 
its role. 

In principle, modification of sense codons could also lead to alterna- 
tive forms of recognition. In this context it is interesting that a ‘V in the 
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positions of the codon-anticodon base pairs of a phenylalanine UUU codon 
with its cognate ASL anticodon from a previous study” are shown on the bottom 
(d-f). Subscripts indicate residue numbers. 


anticodon of a tRNA leads to altered coding of a sense codon”. Finally, 
our results show that the decoding centre of the ribosome has a previ- 
ously unsuspected plasticity that under certain circumstances allows 
alternative non-canonical base pairing, thereby allowing an expansion 
of the genetic code. 


METHODS SUMMARY 


A combination of in vitro transcribed and chemically synthesized RNAs were used 
to generate the three mRNA versions used in the in vitro translation assays in E. 
coli. Aliquots of the in vitro translation reactions were spotted onto nitrocellulose 
membranes and probed with monoclonal anti-His or anti-Flag antibodies’. Cry- 
stallization of 30S ribosomal subunits and their complex formation with tRNA 
ASLs and hexanucleotide mRNAs were done as previously reported’. An initial 
round of refinement with REFMAC" using only the 30S subunit or the empty 70S 
ribosome as starting models to avoid any model bias, clearly showed additional 
differences in density that could be attributable to mRNA and tRNA, or to ASL, as 
shown in Fig. 2. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Bacterial in vitro translation assay. Reporter mRNAs were synthesized essen- 
tially as described previously’. Briefly, we generated a DNA template using overlap- 
extension PCR. The template thus generated contained a T7 promoter, followed 
sequentially by a Shine-Dalgarno (SD) sequence, a His6 tag, a termination codon 
(TAG) and a Flag tag (see Fig. 1), with the sequence: 
5'-GGGAGAGGAGGCAACGACATGGCCGCCCACCACCACCACCACCAC 
GCCGGCGCCGAACAGAAGGAAGAAGACCGCGAG(T/C/¥)AGGACTACA 
AGGACGACGACGACAAGGCCTAG-3’, 

In vitro transcription with T7 RNA polymerase was used to generate the repor- 
ter mRNA transcript containing an authentic termination codon (UAG) immedia- 
tely upstream of the Flag tag. Using the same strategy, we also generated a control 
mRNA transcript, where a CAG sense codon replaced the UAG codon. In addi- 
tion, oligonucleotide-mediated two-piece splint ligation was used to generate ano- 
ther mRNA containing a YAG codon in place of the UAG or CAG codon, as 
described previously’. The 5’ half RNA was in vitro transcribed, ending with GAG 
at its 3’ end. The 3’ half RNA was chemically synthesized, beginning with PAG at 
its 5’ end. The two RNA fragments were aligned through hybridization with a 
complementary bridging oligodeoxynucleotide, and ligated by T4 DNA ligase. 
Thus, all three mRNA transcripts have the same sequence, except for the codon 
immediately upstream of the Flag tag (UAG, CAG or PAG). 

Translation in bacterial cell lysate and dot-blot analysis of proteins. In vitro 
translation was carried out in a 37-1] reaction using E. coli cell lysate (Cosmo Bio 
Co) according to the manufacturer’s instructions. Immediately after the reaction, 
dot-blot analysis was performed as described’. Briefly, small aliquots of the reac- 
tion (2.5 pl and 2.5 ul of a fivefold dilution) were spotted onto the nitrocellulose 
membrane, and probed with a monoclonal anti-His antibody (H-3; Santa Cruz 
Biotechnology) or a monoclonal anti-Flag antibody (M2; Sigma-Aldrich). Goat 
anti-mouse IgG (H+L)-alkaline phosphatase conjugate (Bio-Rad) was then used 
as a secondary antibody. His and Flag signals were visualized using 1-Step NBT/ 
BCIP (Pierce). 

Crystallization. For 30S experiments, the ASLs of tRNAs and the hexanucleo- 
tide mimics of mRNA with the sequence 5’-PAGWAG-3’ (for ASL) or 5'- 
WGAYVGA-3’ (for ASL? ®) were chemically synthesized (Dharmacon). The ASLs 
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were designed with an extra G26—C44 base pair to improve stability. The tRNA 
was produced and purified as previously reported’®. 

Thermus thermophilus 30S ribosomal subunits were purified, crystallized and 

cryoprotected as previously described*’. In the 30S experiments, crystals were 
soaked for 48h in cryoprotection buffer containing 100 11M of the mRNA and 
100 1M of the ASL, and then flash frozen in liquid nitrogen. In the case of the ASL 
for tRNA?"* with a WAG codon, the addition of paromomycin to 100 uM in the 
cryoprotection buffer improved diffraction from about 3.7 A to 3.1 A, so that 
structure was solved in the presence of paromomycin. However, within the limits 
of resolution, no difference was seen in the conformation of the decoding centre. 
For the 70S experiment, a complex of the ribosome with mRNA, tRN AMet in the P 
site and tRNA* in the A site was crystallized as previously described’. The che- 
mically synthesized mRNA was identical to that previously used, except for a VAG 
codon in the A site. 
Data collection and refinement. Data were collected at the Diamond Light Source, 
beamline 104, integrated and scaled with XDS" and refined using REFMAC”. 
Visualization and model building was done with COOT”. An initial round of 
rigid body refinement using the published structure of the T. thermophilus 30S 
(PDB accession 1FJF) or the 70S ribosome (PDB accessions 2WH1 and 2WH2) 
was followed by restrained maximum-likelihood refinement. A final ‘jelly body’ 
refinement in REFMAC" further lowered the R factor and also significantly impro- 
ved the quality of the maps. The mRNA and ASL or tRNA were clearly seen in 
difference Fourier maps. The final model including the ligands and conforma- 
tional changes in the ribosome was refined as earlier. The density for a solvent 
atom adjacent to the 06 of G and I at the third base pair was assigned to a Mg** 
ion based on its coordination and the fact that a water molecule at this position 
refined to an abnormally low B factor. All figures were drawn using PYMOL”. 
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Research institutions in Saudi Arabia, the United Arab Emirates and Qatar are offering opportunities to foreign researchers. 


RELOCATING 


Middle Eastern promise 


Countries on the Arabian Peninsula are vying to attract young scientists to their universities. 


BY QUIRIN SCHIERMEIER 


hen Diego Cuadros told fellow 
scientists that he was moving to 
Qatar, they looked at him in dis- 


belief. What, they asked, did he hope to gain 
from doing research ina small Arab emirate, 
fabulously rich in oil and gas but with no note- 
worthy tradition in science? What ifhe became 
totally disconnected from the collaborations, 
colleague networks and scientific expertise 
that could facilitate his career success? 

But Cuadros, a Colombian-born epide- 
miologist who was finishing his PhD at the 
University of Kentucky in Lexington, knew 
what he was getting into. He had come across 
papers by Laith Abu-Raddad, a public-health 
researcher at Weill Cornell Medical College 
in Qatar (WCMC-Q) — the Doha branch of 
Weill Cornell Medical College in New York — 
and had become interested in Abu-Raddad’s 


research on diabetes and other illnesses. He 
also knew that his chances of getting funding 
in the United States after his PhD were slim. In 
2010, he asked about postdoctoral opportuni- 
ties at the WCMC-Q, and Abu-Raddad invited 
him for a two-month summer internship. 
Cuadros liked what he saw during his brief 
stay in Qatar — and, when Abu-Raddad later 
offered him a postdoctoral position, he packed 
his bags and returned to the Middle East. 
Nineteen months on, Cuadros is glad that he 
made the move. “I get all the support I could 
wish for and I can focus 100% on my malaria 
project,” he says, noting that the emirate pro- 
vides ample resources for research, travel, 
equipment procurement and housing. For the 
first time in his career, he is managing his own 
research money — a three-year, US$100,000 
grant from the Junior Scientists Research 
Experience Program of the Qatar National 
Research Fund. Thanks to generous travel 
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support from Abu-Raddad and the university, 
he can regularly attend conferences in Asia, 
Europe and North America. And he does not 
feel isolated in Doha. The capital’s Education 
City, a large science and technology compound 
launched in 1998 with government support, 
hosts branches of several US and British uni- 
versities, including Texas A&M University, 
Carnegie Mellon University and University 
College London. “I feel better-connected here 
than I did in the United States,” says Cuadros. 


KNOWLEDGE IS POWER 

Qatar is not the only aspiring science hub on 
the Arabian Peninsula. Aware that their oil 
wells will run dry sooner or later, and that a 
knowledge economy may be key to future 
prosperity, Saudi Arabia and the United Arab 
Emirates (UAE) have launched their own 
ambitious efforts to become well regarded in 
international science and higher education. > 
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> Saudi Arabia’s English-speaking King 
Abdullah University of Science and Technol- 
ogy (KAUST) in Thuwal, which graduated its 
first cohort of PhD students last December, 
prides itself on having several dozen leading 
Western researchers on its faculty. The UAE's 
New York University Abu Dhabi (NYUAD) 
attracts science students from around the 
world to its undergraduate courses (it has 
no official PhD programmes yet), and the 
Paris-Sorbonne University Abu Dhabi offers 
programmes in the humanities and social 
sciences. 

The past decade or so also has seen several 
futuristic science campuses rising among the 
sand dunes, complete with new labs, lucrative 
funding opportunities (Qatar spends around 
2.8% of its gross domestic product, or $1 bil- 
lion, on research and higher education each 
year, and KAUST has a $10-billion endow- 
ment) and generous pay that is generally on 
a par with US salaries. Money often comes 
directly from institutions, meaning that 
researchers spend little energy drumming up 
external grants. 

The region is replete with partnerships 
meant to draw in expertise and infrastructure 
from other countries. The Masdar Institute of 
Science and Technology in Abu Dhabi is col- 
laborating with the Massachusetts Institute 
of Technology in Cambridge on research into 
energy and sustainability, and into biofuel for 
aviation. Kourosh Salehi-Ashtiani, a systems 
biologist formerly at Harvard Medical School 
in Boston, Massachusetts, began his research 
there on algae metabolism with a grant from 
the US Department of Energy. He is now work- 
ing at the NYUAD, 
and collaborating 
with people at the 
University of Virginia 
in Charlottesville. 

However, institu- 
tions have plenty of 
catching up to do in 
terms of recruitment 
and amassing exper- 
tise. The NYUAD 


plans to substantially 

expanditsscienceand 

maths programmes Ifeel better - 

and to hire faculty connected here 
than I didin the 


members next year, 
after it moves to a spa- 
cious new campus on 
Saadiyat Island. Since 
January, KAUST has recruited in biosciences 
and materials sciences. 

A stint at a lab in the Gulf can be a valuable 
addition to a Western scientist’s CV, hint- 
ing at personal flexibility, open-mindedness 
and a willingness to take on a certain level of 
risk. But anyone considering the move must 
remember that the scientific environment 
is still new, says Khaled Machaca, associate 
dean for research at the WCMC-Q. Although 


United States.” 
Diego Cuadros 


funding opportunities are ample and new talent 
is welcome, foreign researchers will need to be 
patient about under-developed infrastructure. 
When Salehi-Ashtiani arrived at the NYUAD 
in February 2011, he found that his laboratory 
was nothing but empty benches. He says it took 
several weeks to get to know local suppliers for 
the most urgently needed reagents and pieces of 
equipment — from high-throughput sequenc- 
ers to photo bioreactors and microarrays. 

“Dont expect to find what you're used to,” 
says Machaca. For all their modern conveni- 
ences and regional charms, Doha and Abu 
Dhabi have little to offer in the way of nightlife 
such as concerts, clubs or trendy bars. And the 
local institutions are too new and unproven to 
have the reputation or traditions of the Univer- 
sity of Oxford or Harvard. “If you're looking for 
a research environment identical to the one you 
leave behind, the Gulf region is probably not 
what you're after,’ continues Machaca. “But if 
you have a bent for adventure and thrive on the 
new and exciting, it might be exactly the right 
place for you.” Teething troubles, he says, are 
not unusual in a barely ten-year-old campus on 
the periphery of the scientific world. But gov- 
ernment bureaucracy and research administra- 
tors are getting more experienced and efficient, 
and productive research is possible. 


RECRUITMENT DRIVE 

In 2009, Abu-Raddad left a tenure-track posi- 
tion at the Fred Hutchinson Cancer Research 
Centre in Seattle, Washington, and moved to 
Qatar, eager to be closer to his native Jordan 
and tempted by the no-strings-attached fund- 
ing at the WCMC-Q. “I gave myself one year to 
try out how things might work here,’ he says. 
His expectations were easily surpassed: he won 
$6 million in grants for hepatitis and HIV pro- 
jects from the Qatar National Research Fund, 
and hired six research specialists (temporary 
workers who are typically locally trained grad- 
uate students) whom he found to be exception- 
ally gifted. He decided to stay. 

Recruiting high-calibre postdocs was a chal- 
lenge at first, he says. Many baulked at a stint in 
the Middle East. But most faculty members at 
KAUST and branch universities in the region 
are now being recruited from abroad, and 
early-career researchers have followed, keen to 
take advantage of the funding — and incentives 
such as free or inexpensive on-campus hous- 
ing, free public transport and zero taxes. In 
the past few years, Abu-Raddad has recruited 
postdocs from Colombia, Japan, Hong Kong 
and the United States. In all, there are currently 
around 50 foreign postdocs at the WCMC-Q, 
up from 5 in 2009. And the trends are similar 
elsewhere: in the first half of this year, KAUST 
recruited 15 assistant, associate and full profes- 
sors from abroad. 

The NYUAD’s planned campus move will 
create more than 200 faculty and postdoc 
positions at all levels of experience, says Pance 
Naumov, a chemist who moved from Japan to 
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Abu Dhabi last year. Salehi-Ashtiani says that 
when he advertised a job five years ago, only 
a handful of people applied. Now he receives 
several speculative applications per week. 
Abu-Raddad says that he has chosen his 
postdocs carefully: anyone who thinks that 
a research stay in the Middle East is a nice, 
relaxed way to make 
a good salary and 
polish their CV 
is mistaken. “I’m 
expecting my group 
members to be just 
as hard-driven as 
they would need to 
be to pursue a career 
in the United States, 
Europe or Japan,’ he 
says. “That research 
is still relatively new 


“Getting here doesn’t mean 
something that things are more 
started from easy-going. Getting 
the ground up something started 
requires self - from the ground 


motivation.” 
Laith Abu-Raddad 


up requires, in fact, 
extra stamina and 
self-motivation” 

Those who do venture to the Middle East 
have plenty to gain. Doha and Abu Dhabi — like 
neighbouring Dubai — have large expatriate 
populations, and so have acquired a multicul- 
tural identity that makes it relatively easy for 
foreigners to settle. English is spoken fluently, 
crime is low and hanging out in restaurants or 
cafes or on the beach on a Saturday night is com- 
monplace. Religiously ultra-conservative Saudi 
Arabia is far more restrictive in terms of gender 
equality and in some other regards — alcohol is 
forbidden, for example (it is legal but restricted 
in the UAE and Qatar). However, KAUST oper- 
ates as a mixed-gender campus and there are no 
Saudi religious police on its grounds. Women 
may mingle with men and drive cars while on 
campus, and need not wear Islamic dress unless 
they leave the grounds, at which point they must 
dress modestly. 

Women’s rights are also restricted in Qatar, 
where men enjoy privileged status under fed- 
eral law, and to a lesser degree in the UAE. 
Cuadros advises anyone considering relocat- 
ing to the Gulf to go on an exploratory trip or 
undertake an on-site internship first. His wife, 
plant physiologist Maria Torres, started post- 
doctoral research on the date genome at the 
WCMC-Q in June; she had high hopes, but was 
also nervous. “When I first arrived in Qatar, I 
was prepared to find a place that would chal- 
lenge my cultural boundaries,” she says. But 
she has seen a mixture of cultures, languages 
and opinions in the lab. “Surprisingly, my 
experience has not been that removed from 
my life as a PhD student in the United States? m 


Quirin Schiermeier is Nature’ Germany 
correspondent. 
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Too much hype 


Scientists have to promote their work. But they should 
fight the pull to oversell it, says Monika Maleszewska. 


makes a scientist, I would have said curi- 

osity. Now, after almost three years pursu- 
ing my PhD, I would probably say political 
skills. Genuine curiosity does indeed make 
a good scientist, but the ability to promote 
one’s work makes a successful one. 

No matter how driven they are, research- 
ers need more than expertise and bright 
ideas: they need money. Young scientists 
seeking funding must be ready to enter a 
world for which their degrees have not pre- 
pared them — a world of administrative and 
funding-agency politics, in which they must 
promote their ideas to gain attention and 
receive grants. But they also must take care 
to avoid crossing the line between promo- 
tion and hype. 

In the competitive and expensive world of 
modern science, researchers cannot afford 
to toil away on their own. Lone-wolf scien- 
tists might have their own vision of inno- 
vative, cutting-edge research that will reap 
rewards. But they will probably struggle to 
procure enough funding to do that work. A 
hybrid approach might be to secure money 
through grant applications for ‘fashionable’ 
work (with a pinch of hype where necessary), 
and to hope that the resulting funding will, 
somewhere along the way, let the visionary 
scientist pursue his or her dream project — 
the one that really has an impact. 

As a young scientist learning to navi- 
gate these issues, I often hear the following 
advice: communicate more effectively. If 
your project is in basic science or is difficult 
to understand, people say, make it simpler. 
Nicer. Easier to digest. Yet scientists thrive 
on precision. So sometimes, when pressed to 
make our projects sound simpler and more 
attractive, we choose hype as an easy way out. 

Fashionable keywords, which change 
almost seasonally, help our projects to sound 
more relevant to the current trends. Society 
expects science to have applications, so we 
readily slip in some socially relevant perspec- 
tive. A bit of exaggeration about expected 
results or future uses does not bother our 
consciences, as long as we perceive it as 
unbridled enthusiasm. 

A skilful presentation and a positive atti- 
tude can make a huge difference in how a 
scientist’s work is perceived. Give two PhD 
students the same set of data to present, and 


E you had asked me a few years ago what 


one may put the audience to sleep with dry 
delivery, whereas the other might spark a 
vigorous discussion, perhaps winning a 
collaborator. 

Yet despite being under constant pressure 
as we climb the ranks of academia, scien- 
tists must learn to navigate the blurred line 
between hype and savvy promotion. Young 
researchers who frequently exaggerate the 
implications of their findings or make hasty 
conclusions risk harming their reputations 
and losing the trust of their colleagues. 

I often wish that scientists had the luxury 
of being able to do basic research just because 
it is interesting. But reviewers sometimes 
gravitate towards the projects that provide 
direct solutions to burning problems, rather 
than to basic projects with no clear applica- 
tions. We must give basic projects a chance, 
especially because breakthroughs are hard 
to anticipate. That basic science might be 
closer to a meaningful application than any- 
one expects. 

In the competitive world of scientific fund- 
ing, researchers often have no choice but to 
hone their political skills and manage public 
relations for their research. Ideally, they will 
be able to do this without taking too much 
time away from the science. What’s clear is 
that budding researchers must learn how 
to promote their work, and perhaps even 
become trendsetters — without resorting 
to hype. = 


Monika Maleszewska is a graduate student 
in regenerative medicine at the University of 
Groningen in the Netherlands. 
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PUBLISHING 
Fashion rules in physics 


Physicists are fad-followers who often 
pursue the fields with the highest number 
of recently published papers, says a study 
(T. Wei et al. Sci. Rep. 3, 2207; 2013). The 
metareview, of around 320,000 articles 
published by the American Physical 
Society’s Physical Review journals from 
1976 to 2009, also found that papers with 
many authors, which are common in some 
areas of physics, tend to be on ‘hot’ research 
topics. Working in a hot field can be a good 
strategy, says lead author Jinshan Wu, a 
physicist at Beijing Normal University, 
because it can increase citations, attention 
from peers and research funding. But, he 
notes, the scientific community can suffer 
if leaders are not exploring new directions. 


EUROPEAN UNION 


Grant popularity soars 


A 50% spike in demand for European 
Research Council (ERC) Starting Grants 
has pulled this year’s success rate down to 
9%, and is likely to lead to changes in some 
eligibility requirements. ERC president 
Helga Nowotny says that the increase — 

to 3,329 applications this year, of which 
287 were successful — is attributable 

in part to research-budget cuts in some 
European nations. The ERC, based in 
Brussels, will consider lengthening the 
period between resubmissions to reduce 
the flow of applications, she adds. Starting 
Grants, now in their sixth year, fund 
researchers who finished their doctorate in 
the past 2-7 years. This year, the ERC spent 
nearly €400 million (US$527 million) 

on the awards, which are worth up to 

€2 million each and last for up to five years. 


FUND-RAISING 
Art sales fund science 


Sales of stylized scientific images, including 
micrographs of human-heart and mouse- 
brain cells, are helping to pay for early- 
career researchers to travel. Proceeds from 
an art fair and website (go.nature.com/ 
aakuwa) have raised some US$30,000 so 
far, enough to send 60 graduate students 
and postdocs from the University of 
Michigan in Ann Arbor to conferences. 
And there is another benefit: “We started 
this as a fund-raiser, but it turns out to 

be an awesome public-outreach tool,” 

says Deborah Gumucio, founder of the 
Michigan Center for Organogenesis, which 
runs the Bio-Artography project. “People 
come into our fair booth and we can talk to 
them about pluripotent stem cells” 
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Uae SCIENCE FICTION 


THE BEST OF US 


BY LEE HALLISON 


ind howled over blackened trees 
as Ginja picked her way to me 
over broken highway lumps. 


A tough cookie, the kind that sometimes 
broke. Hard to judge on the first day. On the 
other hand, soft ones didn’t even get to try. 
Like the boy Id refused yesterday? He had no 
idea the favour I'd done his sorry ass. 

I train recruits for scavenger duty. If this 
one passed the first-day test, she was in luck. 
I hadn't always been a teacher, but I'd be god- 
damned if I wasn't a good one. 

“Ears hurt,’ Ginja complained when she 
reached the bus. The wind out here was 
painful, a never-ending screech, as if Earth 
itself was moaning. The nearly useless hel- 
mets we wore had way-too-thin ear pads and 
faceplates that barely protected us from the 
sharp debris and swirling dust. 

Faded lettering across the tilted bus 
announced “Panorama Tours!” Ginja set 
her half-filled sack by the left wheel. The bus 
seemed stable. Two wheels on the cracked 
asphalt, two in a long furrow in the bare 
median strip. 

I swung my rifle butt into the Grey- 
hound’s door. The glass shattered. I reached 
in and yanked the inner handle. A musty- 
sour smell wafted over us as I opened the 
door. 

The wind dropped as I climbed in. Ginja 
stepped up. This was the moment when 
some would-be scavengers freaked out. 

“Look at all this stuff!” 

Hand luggage was strewn in the aisle. Skel- 
etons draped with decayed clothing slumped 
in the seats. 

“Yes, a good haul.” Shed do. I shifted my 
rifle to the side to let her by. 

Vehicles sometimes hosted rats or feral 
dogs. This bus seemed empty, so I pointed 
the rifle down — but stayed ready. I wasn’t 
about to break my track record. My trainees 
came back. 

“Stinks!” Ginja pulled her faceplate back 
down. She reached over the driver's headless 
skeleton and yanked the window open. 

“Jeez, shut it! Are you stupid?” I yelled. She 
pushed, but the frame stuck. Now we were 
ina mini-windstorm inside a bus filled with 
dead people's dust. Maybe I was mistaken. 
Tough was good but smart was better. 

I forced the window closed. 

“Ask me before you do shit. Wind comes 
from this side. If you need air, open one on 
the lee side” 


116 | NATURE | VOL 500 | 1 AUGUST 20 


It’s no game. 


She nodded, worry that shed blown it 
written on her face. I ignored her dismay 
and told her what to search for. I'd stay up 
front as guard. 

“Batteries, books, all electronics and toys,” 
I said. “Any medicines or creams, even open 
tubes. Clothing if it looks strong, check for 
quality — thin or torn-up stuff ain't worth 


saving. Pass on food. It won't be good any- 
more.” 

“Even cans?” She unzipped a large ath- 
letic-style bag. 

“Nobody brought food cans on tours. Use 
your brains.” 

She picked up a skull that was resting on 
a purse snuggled between two bony knees. 
Before I could stop her, she opened a lee-side 
window and tossed it out. 

“Hey! No desecrating the dead. That's 
important” 

“Surviving is important. Finding stuff 
to haul back is important. Where does giv- 
ing a shit fit in?” Ginja stood, skinny arms 
akimbo, chin out. 

“Being human, that’s where” 

She snorted but didn’t throw away any 
more body parts. She worked quickly, 
tossing a mobile phone and a cosmetics 


bag into the growing 

pile in the aisle, then NATURE.COM 
pulledablack leather — Follow Futures: 
duffel from under- © @NatureFutures 
neath the seat. EF go.nature.com/mtoodm 
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Asmall “oh” drew my attention away from 
the windows. 

Ginja held up a dirty white ball. 

“Soccer.” 

“Yeah, so?” 

“My brother played soccer. That could 
have been him.” She looked at the seated 
skeleton. 

Istayed quiet, letting her chew on the idea. 

She set the ball on the pile. Gently, though, 
and her hand lingered. I looked out front, 
giving her space. I wondered, idly curious, if 
she would break this soon. 

She coughed and pulled out a shopping 
bag, dumping the papers it held. I heard a 
scritch from the back and turned to see a fat 
furry shape streak over a seatback. 

“Rat! Get behind me!” 

Ginja jumped up and we swivelled around 
each other in the narrow aisle. The rats were 
fearless, vicious and carried disease our mea- 
gre antibiotics couldn't fight. I aimed the rifle 
and waited. Soon enough, it poked a head up 
and stared at me. I let it nose up higher and 
squeezed the trigger. 

Ginja yelled at the echoing boom and we 
both fell back a bit. I'd seen the blood spray. 
I kept the gun aimed towards the back and 
motioned to her pile. 

“Grab it, we'd better get out. Gunshot 
might bring the curious.” 

Her face was ashen. She stepped around 
me, squatted down and scooped her finds 
into the shopping bag. The soccer ball rolled 
slowly away and she looked up at me. 

“Yeah, sure, get it.” 

She duck-walked towards the ball and 
grabbed it before it got too far. She squeezed 
it to her chest, her lip trembling. She jabbed 
her chin at the disarrayed skeletons. 

“All these people. Oh, jeez. They’re all 
people!” 

Icocked my head. 

“What did you think they were?” 

She shuddered, stuck the ball in the bag 
and stood up. 

“Stupid, I know.” 

“No, youre not stupid. It’s hard to care and 
it’s hard not to care” 

As she reached the front steps, she turned. 

“Cap?” She blinked as if something were 
in her eyes. “Do we have time to bury them?” 

We didn't, but I was pleased. The best of us 
knew just exactly what we were doing. 

I passed her, of course. m 


Lee Hallison is a writer living in the Pacific 
Northwest. She blogs at leehallison.com. 
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